Overview

Brought to you by YData

Dataset statistics

 Full DatasetStratified Sample
Number of variables7878
Number of observations100000030000
Missing cells00
Missing cells (%)0.0%0.0%
Total size in memory595.1 MiB17.9 MiB
Average record size in memory624.0 B624.0 B

Variable types

 Full DatasetStratified Sample
Numeric4040
Text3838

Alerts

Full DatasetStratified Sample
customer_id has unique values customer_id has unique values Unique
membership_years has 99846 (10.0%) zeros membership_years has 3030 (10.1%) zeros Zeros
number_of_children has 199753 (20.0%) zeros number_of_children has 5930 (19.8%) zeros Zeros
transaction_hour has 41756 (4.2%) zeros transaction_hour has 1277 (4.3%) zeros Zeros
avg_discount_used has 10010 (1.0%) zeros Alert not present in this datasetZeros
in_store_purchases has 10016 (1.0%) zeros in_store_purchases has 321 (1.1%) zeros Zeros
total_returned_items has 100060 (10.0%) zeros total_returned_items has 3026 (10.1%) zeros Zeros
product_stock has 10174 (1.0%) zeros product_stock has 317 (1.1%) zeros Zeros
customer_support_calls has 49755 (5.0%) zeros customer_support_calls has 1525 (5.1%) zeros Zeros
website_visits has 10111 (1.0%) zeros Alert not present in this datasetZeros

Reproduction

 Full DatasetStratified Sample
Analysis started2025-06-06 02:26:31.3359082025-06-06 02:28:30.874457
Analysis finished2025-06-06 02:28:30.8400002025-06-06 02:28:35.688336
Duration1 minute and 59.5 seconds4.81 seconds
Software versionydata-profiling vv4.16.1ydata-profiling vv4.16.1
Download configurationconfig.jsonconfig.json

Variables

customer_id
Real number (ℝ)

 Full DatasetStratified Sample
Distinct100000030000
Distinct (%)100.0%100.0%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean500000.5500803.1024
 Full DatasetStratified Sample
Minimum112
Maximum1000000999880
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:36.099075image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum112
5-th percentile50000.9551637.35
Q1250000.75249894
median500000.5502067.5
Q3750000.25752820.25
95-th percentile950000.05947160.85
Maximum1000000999880
Range999999999868
Interquartile range (IQR)499999.5502926.25

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation288675.2789288853.064
Coefficient of variation (CV)0.57734998050.5767797018
Kurtosis-1.2-1.208484499
Mean500000.5500803.1024
Median Absolute Deviation (MAD)250000251514
Skewness-2.511790261 × 10-15-0.008815474411
Sum5.000005 × 10111.502409307 × 1010
Variance8.333341667 × 10108.343609261 × 1010
MonotonicityStrictly increasingNot monotonic
2025-06-06T02:28:36.325346image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
999984 1
 
< 0.1%
999983 1
 
< 0.1%
999982 1
 
< 0.1%
999981 1
 
< 0.1%
999980 1
 
< 0.1%
999979 1
 
< 0.1%
999978 1
 
< 0.1%
999977 1
 
< 0.1%
999976 1
 
< 0.1%
999975 1
 
< 0.1%
Other values (999990) 999990
> 99.9%
ValueCountFrequency (%)
162060 1
 
< 0.1%
962145 1
 
< 0.1%
988334 1
 
< 0.1%
703785 1
 
< 0.1%
729865 1
 
< 0.1%
944983 1
 
< 0.1%
259621 1
 
< 0.1%
507975 1
 
< 0.1%
5872 1
 
< 0.1%
403580 1
 
< 0.1%
Other values (29990) 29990
> 99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
ValueCountFrequency (%)
12 1
< 0.1%
18 1
< 0.1%
34 1
< 0.1%
125 1
< 0.1%
147 1
< 0.1%
ValueCountFrequency (%)
12 1
< 0.1%
18 1
< 0.1%
34 1
< 0.1%
125 1
< 0.1%
147 1
< 0.1%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%

age
Real number (ℝ)

 Full DatasetStratified Sample
Distinct6262
Distinct (%)< 0.1%0.2%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean48.49660548.33273333
 Full DatasetStratified Sample
Minimum1818
Maximum7979
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:36.576190image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum1818
5-th percentile2121
Q13333
median4948
Q36464
95-th percentile7676
Maximum7979
Range6161
Interquartile range (IQR)3131

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation17.8743811617.89495728
Coefficient of variation (CV)0.36856974140.3702450917
Kurtosis-1.198117884-1.195486737
Mean48.49660548.33273333
Median Absolute Deviation (MAD)1515
Skewness-0.00027699457540.009864896331
Sum484966051449982
Variance319.493502320.2294962
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:36.802466image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
53 16423
 
1.6%
54 16412
 
1.6%
33 16407
 
1.6%
36 16363
 
1.6%
62 16324
 
1.6%
39 16290
 
1.6%
34 16284
 
1.6%
40 16274
 
1.6%
32 16264
 
1.6%
19 16248
 
1.6%
Other values (52) 836711
83.7%
ValueCountFrequency (%)
20 539
 
1.8%
78 536
 
1.8%
36 533
 
1.8%
62 527
 
1.8%
37 523
 
1.7%
57 516
 
1.7%
25 516
 
1.7%
30 514
 
1.7%
58 514
 
1.7%
21 508
 
1.7%
Other values (52) 24774
82.6%
ValueCountFrequency (%)
18 16003
1.6%
19 16248
1.6%
20 16116
1.6%
21 16016
1.6%
22 16211
1.6%
ValueCountFrequency (%)
18 465
1.6%
19 490
1.6%
20 539
1.8%
21 508
1.7%
22 500
1.7%
ValueCountFrequency (%)
18 465
< 0.1%
19 490
< 0.1%
20 539
0.1%
21 508
0.1%
22 500
0.1%
ValueCountFrequency (%)
18 16003
53.3%
19 16248
54.2%
20 16116
53.7%
21 16016
53.4%
22 16211
54.0%

gender
['Text', 'Text']

 Full DatasetStratified Sample
Distinct33
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:37.043009image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length66
Median length55
Mean length5.0011744.9948
Min length44

Characters and Unicode

 Full DatasetStratified Sample
Total characters5001174149844
Distinct characters1010
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowOtherOther
2nd rowFemaleFemale
3rd rowFemaleOther
4th rowFemaleMale
5th rowFemaleFemale
ValueCountFrequency (%)
other 333734
33.4%
female 333720
33.4%
male 332546
33.3%
ValueCountFrequency (%)
male 10085
33.6%
other 9986
33.3%
female 9929
33.1%
2025-06-06T02:28:37.363039image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 1333720
26.7%
a 666266
13.3%
l 666266
13.3%
O 333734
 
6.7%
t 333734
 
6.7%
h 333734
 
6.7%
r 333734
 
6.7%
F 333720
 
6.7%
m 333720
 
6.7%
M 332546
 
6.6%
ValueCountFrequency (%)
e 39929
26.6%
a 20014
13.4%
l 20014
13.4%
M 10085
 
6.7%
O 9986
 
6.7%
t 9986
 
6.7%
h 9986
 
6.7%
r 9986
 
6.7%
F 9929
 
6.6%
m 9929
 
6.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 5001174
100.0%
ValueCountFrequency (%)
(unknown) 149844
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 1333720
26.7%
a 666266
13.3%
l 666266
13.3%
O 333734
 
6.7%
t 333734
 
6.7%
h 333734
 
6.7%
r 333734
 
6.7%
F 333720
 
6.7%
m 333720
 
6.7%
M 332546
 
6.6%
ValueCountFrequency (%)
e 39929
26.6%
a 20014
13.4%
l 20014
13.4%
M 10085
 
6.7%
O 9986
 
6.7%
t 9986
 
6.7%
h 9986
 
6.7%
r 9986
 
6.7%
F 9929
 
6.6%
m 9929
 
6.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 5001174
100.0%
ValueCountFrequency (%)
(unknown) 149844
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 1333720
26.7%
a 666266
13.3%
l 666266
13.3%
O 333734
 
6.7%
t 333734
 
6.7%
h 333734
 
6.7%
r 333734
 
6.7%
F 333720
 
6.7%
m 333720
 
6.7%
M 332546
 
6.6%
ValueCountFrequency (%)
e 39929
26.6%
a 20014
13.4%
l 20014
13.4%
M 10085
 
6.7%
O 9986
 
6.7%
t 9986
 
6.7%
h 9986
 
6.7%
r 9986
 
6.7%
F 9929
 
6.6%
m 9929
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 5001174
100.0%
ValueCountFrequency (%)
(unknown) 149844
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 1333720
26.7%
a 666266
13.3%
l 666266
13.3%
O 333734
 
6.7%
t 333734
 
6.7%
h 333734
 
6.7%
r 333734
 
6.7%
F 333720
 
6.7%
m 333720
 
6.7%
M 332546
 
6.6%
ValueCountFrequency (%)
e 39929
26.6%
a 20014
13.4%
l 20014
13.4%
M 10085
 
6.7%
O 9986
 
6.7%
t 9986
 
6.7%
h 9986
 
6.7%
r 9986
 
6.7%
F 9929
 
6.6%
m 9929
 
6.6%

income_bracket
['Text', 'Text']

 Full DatasetStratified Sample
Distinct33
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:37.577753image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length66
Median length44
Mean length4.3337134.3337
Min length33

Characters and Unicode

 Full DatasetStratified Sample
Total characters4333713130011
Distinct characters1212
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowHighHigh
2nd rowMediumLow
3rd rowLowLow
4th rowLowMedium
5th rowLowHigh
ValueCountFrequency (%)
high 333612
33.4%
medium 333367
33.3%
low 333021
33.3%
ValueCountFrequency (%)
high 10008
33.4%
medium 10001
33.3%
low 9991
33.3%
2025-06-06T02:28:37.910523image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 666979
15.4%
H 333612
7.7%
g 333612
7.7%
h 333612
7.7%
M 333367
7.7%
e 333367
7.7%
d 333367
7.7%
u 333367
7.7%
m 333367
7.7%
L 333021
7.7%
Other values (2) 666042
15.4%
ValueCountFrequency (%)
i 20009
15.4%
H 10008
7.7%
g 10008
7.7%
h 10008
7.7%
M 10001
7.7%
e 10001
7.7%
d 10001
7.7%
u 10001
7.7%
m 10001
7.7%
L 9991
7.7%
Other values (2) 19982
15.4%

Most occurring categories

ValueCountFrequency (%)
(unknown) 4333713
100.0%
ValueCountFrequency (%)
(unknown) 130011
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
i 666979
15.4%
H 333612
7.7%
g 333612
7.7%
h 333612
7.7%
M 333367
7.7%
e 333367
7.7%
d 333367
7.7%
u 333367
7.7%
m 333367
7.7%
L 333021
7.7%
Other values (2) 666042
15.4%
ValueCountFrequency (%)
i 20009
15.4%
H 10008
7.7%
g 10008
7.7%
h 10008
7.7%
M 10001
7.7%
e 10001
7.7%
d 10001
7.7%
u 10001
7.7%
m 10001
7.7%
L 9991
7.7%
Other values (2) 19982
15.4%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 4333713
100.0%
ValueCountFrequency (%)
(unknown) 130011
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
i 666979
15.4%
H 333612
7.7%
g 333612
7.7%
h 333612
7.7%
M 333367
7.7%
e 333367
7.7%
d 333367
7.7%
u 333367
7.7%
m 333367
7.7%
L 333021
7.7%
Other values (2) 666042
15.4%
ValueCountFrequency (%)
i 20009
15.4%
H 10008
7.7%
g 10008
7.7%
h 10008
7.7%
M 10001
7.7%
e 10001
7.7%
d 10001
7.7%
u 10001
7.7%
m 10001
7.7%
L 9991
7.7%
Other values (2) 19982
15.4%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 4333713
100.0%
ValueCountFrequency (%)
(unknown) 130011
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
i 666979
15.4%
H 333612
7.7%
g 333612
7.7%
h 333612
7.7%
M 333367
7.7%
e 333367
7.7%
d 333367
7.7%
u 333367
7.7%
m 333367
7.7%
L 333021
7.7%
Other values (2) 666042
15.4%
ValueCountFrequency (%)
i 20009
15.4%
H 10008
7.7%
g 10008
7.7%
h 10008
7.7%
M 10001
7.7%
e 10001
7.7%
d 10001
7.7%
u 10001
7.7%
m 10001
7.7%
L 9991
7.7%
Other values (2) 19982
15.4%

loyalty_program
['Text', 'Text']

 Full DatasetStratified Sample
Distinct22
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:38.071060image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length33
Median length22
Mean length2.4997122.499666667
Min length22

Characters and Unicode

 Full DatasetStratified Sample
Total characters249971274990
Distinct characters55
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowNoNo
2nd rowNoNo
3rd rowNoYes
4th rowNoNo
5th rowYesNo
ValueCountFrequency (%)
no 500288
50.0%
yes 499712
50.0%
ValueCountFrequency (%)
no 15010
50.0%
yes 14990
50.0%
2025-06-06T02:28:38.349370image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
N 500288
20.0%
o 500288
20.0%
Y 499712
20.0%
e 499712
20.0%
s 499712
20.0%
ValueCountFrequency (%)
N 15010
20.0%
o 15010
20.0%
Y 14990
20.0%
e 14990
20.0%
s 14990
20.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2499712
100.0%
ValueCountFrequency (%)
(unknown) 74990
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N 500288
20.0%
o 500288
20.0%
Y 499712
20.0%
e 499712
20.0%
s 499712
20.0%
ValueCountFrequency (%)
N 15010
20.0%
o 15010
20.0%
Y 14990
20.0%
e 14990
20.0%
s 14990
20.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2499712
100.0%
ValueCountFrequency (%)
(unknown) 74990
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N 500288
20.0%
o 500288
20.0%
Y 499712
20.0%
e 499712
20.0%
s 499712
20.0%
ValueCountFrequency (%)
N 15010
20.0%
o 15010
20.0%
Y 14990
20.0%
e 14990
20.0%
s 14990
20.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2499712
100.0%
ValueCountFrequency (%)
(unknown) 74990
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N 500288
20.0%
o 500288
20.0%
Y 499712
20.0%
e 499712
20.0%
s 499712
20.0%
ValueCountFrequency (%)
N 15010
20.0%
o 15010
20.0%
Y 14990
20.0%
e 14990
20.0%
s 14990
20.0%

membership_years
Real number (ℝ)

 Full DatasetStratified Sample
Distinct1010
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean4.4974534.512466667
 Full DatasetStratified Sample
Minimum00
Maximum99
Zeros998463030
Zeros (%)10.0%10.1%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:38.446562image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum00
5-th percentile00
Q122
median45
Q377
95-th percentile99
Maximum99
Range99
Interquartile range (IQR)55

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation2.8724055712.876847891
Coefficient of variation (CV)0.63867383850.637533328
Kurtosis-1.22454665-1.222483107
Mean4.4974534.512466667
Median Absolute Deviation (MAD)32
Skewness0.001590463324-0.009342021881
Sum4497453135374
Variance8.2507137648.276253791
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:38.566776image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1 100686
10.1%
5 100183
10.0%
4 100137
10.0%
9 99977
10.0%
2 99964
10.0%
8 99891
10.0%
6 99865
10.0%
0 99846
10.0%
7 99728
10.0%
3 99723
10.0%
ValueCountFrequency (%)
5 3096
10.3%
9 3060
10.2%
0 3030
10.1%
6 3021
10.1%
7 3020
10.1%
3 3008
10.0%
1 2998
10.0%
8 2951
9.8%
4 2913
9.7%
2 2903
9.7%
ValueCountFrequency (%)
0 99846
10.0%
1 100686
10.1%
2 99964
10.0%
3 99723
10.0%
4 100137
10.0%
ValueCountFrequency (%)
0 3030
10.1%
1 2998
10.0%
2 2903
9.7%
3 3008
10.0%
4 2913
9.7%
ValueCountFrequency (%)
0 3030
0.3%
1 2998
0.3%
2 2903
0.3%
3 3008
0.3%
4 2913
0.3%
ValueCountFrequency (%)
0 99846
332.8%
1 100686
335.6%
2 99964
333.2%
3 99723
332.4%
4 100137
333.8%

churned
['Text', 'Text']

 Full DatasetStratified Sample
Distinct22
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:38.721469image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length33
Median length22
Mean length2.4997292.496266667
Min length22

Characters and Unicode

 Full DatasetStratified Sample
Total characters249972974888
Distinct characters55
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowNoYes
2nd rowNoYes
3rd rowNoNo
4th rowNoNo
5th rowYesYes
ValueCountFrequency (%)
no 500271
50.0%
yes 499729
50.0%
ValueCountFrequency (%)
no 15112
50.4%
yes 14888
49.6%
2025-06-06T02:28:38.988568image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
N 500271
20.0%
o 500271
20.0%
Y 499729
20.0%
e 499729
20.0%
s 499729
20.0%
ValueCountFrequency (%)
N 15112
20.2%
o 15112
20.2%
Y 14888
19.9%
e 14888
19.9%
s 14888
19.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2499729
100.0%
ValueCountFrequency (%)
(unknown) 74888
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N 500271
20.0%
o 500271
20.0%
Y 499729
20.0%
e 499729
20.0%
s 499729
20.0%
ValueCountFrequency (%)
N 15112
20.2%
o 15112
20.2%
Y 14888
19.9%
e 14888
19.9%
s 14888
19.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2499729
100.0%
ValueCountFrequency (%)
(unknown) 74888
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N 500271
20.0%
o 500271
20.0%
Y 499729
20.0%
e 499729
20.0%
s 499729
20.0%
ValueCountFrequency (%)
N 15112
20.2%
o 15112
20.2%
Y 14888
19.9%
e 14888
19.9%
s 14888
19.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2499729
100.0%
ValueCountFrequency (%)
(unknown) 74888
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N 500271
20.0%
o 500271
20.0%
Y 499729
20.0%
e 499729
20.0%
s 499729
20.0%
ValueCountFrequency (%)
N 15112
20.2%
o 15112
20.2%
Y 14888
19.9%
e 14888
19.9%
s 14888
19.9%

marital_status
['Text', 'Text']

 Full DatasetStratified Sample
Distinct33
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:39.186892image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length88
Median length77
Mean length7.0008667.001233333
Min length66

Characters and Unicode

 Full DatasetStratified Sample
Total characters7000866210037
Distinct characters1414
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowDivorcedDivorced
2nd rowMarriedDivorced
3rd rowMarriedDivorced
4th rowDivorcedMarried
5th rowDivorcedMarried
ValueCountFrequency (%)
divorced 333816
33.4%
married 333234
33.3%
single 332950
33.3%
ValueCountFrequency (%)
married 10067
33.6%
divorced 9985
33.3%
single 9948
33.2%
2025-06-06T02:28:39.810661image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
r 1000284
14.3%
i 1000000
14.3%
e 1000000
14.3%
d 667050
9.5%
D 333816
 
4.8%
v 333816
 
4.8%
c 333816
 
4.8%
o 333816
 
4.8%
M 333234
 
4.8%
a 333234
 
4.8%
Other values (4) 1331800
19.0%
ValueCountFrequency (%)
r 30119
14.3%
i 30000
14.3%
e 30000
14.3%
d 20052
9.5%
a 10067
 
4.8%
M 10067
 
4.8%
D 9985
 
4.8%
v 9985
 
4.8%
o 9985
 
4.8%
c 9985
 
4.8%
Other values (4) 39792
18.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 7000866
100.0%
ValueCountFrequency (%)
(unknown) 210037
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
r 1000284
14.3%
i 1000000
14.3%
e 1000000
14.3%
d 667050
9.5%
D 333816
 
4.8%
v 333816
 
4.8%
c 333816
 
4.8%
o 333816
 
4.8%
M 333234
 
4.8%
a 333234
 
4.8%
Other values (4) 1331800
19.0%
ValueCountFrequency (%)
r 30119
14.3%
i 30000
14.3%
e 30000
14.3%
d 20052
9.5%
a 10067
 
4.8%
M 10067
 
4.8%
D 9985
 
4.8%
v 9985
 
4.8%
o 9985
 
4.8%
c 9985
 
4.8%
Other values (4) 39792
18.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 7000866
100.0%
ValueCountFrequency (%)
(unknown) 210037
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
r 1000284
14.3%
i 1000000
14.3%
e 1000000
14.3%
d 667050
9.5%
D 333816
 
4.8%
v 333816
 
4.8%
c 333816
 
4.8%
o 333816
 
4.8%
M 333234
 
4.8%
a 333234
 
4.8%
Other values (4) 1331800
19.0%
ValueCountFrequency (%)
r 30119
14.3%
i 30000
14.3%
e 30000
14.3%
d 20052
9.5%
a 10067
 
4.8%
M 10067
 
4.8%
D 9985
 
4.8%
v 9985
 
4.8%
o 9985
 
4.8%
c 9985
 
4.8%
Other values (4) 39792
18.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 7000866
100.0%
ValueCountFrequency (%)
(unknown) 210037
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
r 1000284
14.3%
i 1000000
14.3%
e 1000000
14.3%
d 667050
9.5%
D 333816
 
4.8%
v 333816
 
4.8%
c 333816
 
4.8%
o 333816
 
4.8%
M 333234
 
4.8%
a 333234
 
4.8%
Other values (4) 1331800
19.0%
ValueCountFrequency (%)
r 30119
14.3%
i 30000
14.3%
e 30000
14.3%
d 20052
9.5%
a 10067
 
4.8%
M 10067
 
4.8%
D 9985
 
4.8%
v 9985
 
4.8%
o 9985
 
4.8%
c 9985
 
4.8%
Other values (4) 39792
18.9%

number_of_children
Real number (ℝ)

 Full DatasetStratified Sample
Distinct55
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean2.0005541.998133333
 Full DatasetStratified Sample
Minimum00
Maximum44
Zeros1997535930
Zeros (%)20.0%19.8%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:39.901204image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum00
5-th percentile00
Q111
median22
Q333
95-th percentile44
Maximum44
Range44
Interquartile range (IQR)22

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation1.4142141611.407644332
Coefficient of variation (CV)0.70691126610.7044796802
Kurtosis-1.300270709-1.290005138
Mean2.0005541.998133333
Median Absolute Deviation (MAD)11
Skewness-0.00012232956460.001157855148
Sum200055459944
Variance2.0000016931.981462564
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:39.992659image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=5)
ValueCountFrequency (%)
1 200307
20.0%
4 200157
20.0%
3 200053
20.0%
0 199753
20.0%
2 199730
20.0%
ValueCountFrequency (%)
3 6063
20.2%
1 6059
20.2%
2 6048
20.2%
0 5930
19.8%
4 5900
19.7%
ValueCountFrequency (%)
0 199753
20.0%
1 200307
20.0%
2 199730
20.0%
3 200053
20.0%
4 200157
20.0%
ValueCountFrequency (%)
0 5930
19.8%
1 6059
20.2%
2 6048
20.2%
3 6063
20.2%
4 5900
19.7%
ValueCountFrequency (%)
0 5930
0.6%
1 6059
0.6%
2 6048
0.6%
3 6063
0.6%
4 5900
0.6%
ValueCountFrequency (%)
0 199753
665.8%
1 200307
667.7%
2 199730
665.8%
3 200053
666.8%
4 200157
667.2%

education_level
['Text', 'Text']

 Full DatasetStratified Sample
Distinct44
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:40.223076image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length1111
Median length1010
Mean length8.000648.017833333
Min length33

Characters and Unicode

 Full DatasetStratified Sample
Total characters8000640240535
Distinct characters1919
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowBachelor'sBachelor's
2nd rowPhDHigh School
3rd rowBachelor'sPhD
4th rowMaster'sBachelor's
5th rowBachelor'sMaster's
ValueCountFrequency (%)
bachelor's 250360
20.0%
high 250105
20.0%
school 250105
20.0%
phd 250079
20.0%
master's 249456
20.0%
ValueCountFrequency (%)
high 7575
20.2%
school 7575
20.2%
bachelor's 7565
20.1%
phd 7464
19.9%
master's 7396
19.7%
2025-06-06T02:28:40.582974image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
h 1000649
12.5%
o 750570
 
9.4%
s 749272
 
9.4%
c 500465
 
6.3%
l 500465
 
6.3%
e 499816
 
6.2%
a 499816
 
6.2%
' 499816
 
6.2%
r 499816
 
6.2%
B 250360
 
3.1%
Other values (9) 2249595
28.1%
ValueCountFrequency (%)
h 30179
12.5%
o 22715
 
9.4%
s 22357
 
9.3%
c 15140
 
6.3%
l 15140
 
6.3%
e 14961
 
6.2%
a 14961
 
6.2%
' 14961
 
6.2%
r 14961
 
6.2%
H 7575
 
3.1%
Other values (9) 67585
28.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 8000640
100.0%
ValueCountFrequency (%)
(unknown) 240535
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
h 1000649
12.5%
o 750570
 
9.4%
s 749272
 
9.4%
c 500465
 
6.3%
l 500465
 
6.3%
e 499816
 
6.2%
a 499816
 
6.2%
' 499816
 
6.2%
r 499816
 
6.2%
B 250360
 
3.1%
Other values (9) 2249595
28.1%
ValueCountFrequency (%)
h 30179
12.5%
o 22715
 
9.4%
s 22357
 
9.3%
c 15140
 
6.3%
l 15140
 
6.3%
e 14961
 
6.2%
a 14961
 
6.2%
' 14961
 
6.2%
r 14961
 
6.2%
H 7575
 
3.1%
Other values (9) 67585
28.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 8000640
100.0%
ValueCountFrequency (%)
(unknown) 240535
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
h 1000649
12.5%
o 750570
 
9.4%
s 749272
 
9.4%
c 500465
 
6.3%
l 500465
 
6.3%
e 499816
 
6.2%
a 499816
 
6.2%
' 499816
 
6.2%
r 499816
 
6.2%
B 250360
 
3.1%
Other values (9) 2249595
28.1%
ValueCountFrequency (%)
h 30179
12.5%
o 22715
 
9.4%
s 22357
 
9.3%
c 15140
 
6.3%
l 15140
 
6.3%
e 14961
 
6.2%
a 14961
 
6.2%
' 14961
 
6.2%
r 14961
 
6.2%
H 7575
 
3.1%
Other values (9) 67585
28.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 8000640
100.0%
ValueCountFrequency (%)
(unknown) 240535
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
h 1000649
12.5%
o 750570
 
9.4%
s 749272
 
9.4%
c 500465
 
6.3%
l 500465
 
6.3%
e 499816
 
6.2%
a 499816
 
6.2%
' 499816
 
6.2%
r 499816
 
6.2%
B 250360
 
3.1%
Other values (9) 2249595
28.1%
ValueCountFrequency (%)
h 30179
12.5%
o 22715
 
9.4%
s 22357
 
9.3%
c 15140
 
6.3%
l 15140
 
6.3%
e 14961
 
6.2%
a 14961
 
6.2%
' 14961
 
6.2%
r 14961
 
6.2%
H 7575
 
3.1%
Other values (9) 67585
28.1%

occupation
['Text', 'Text']

 Full DatasetStratified Sample
Distinct44
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:40.814123image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length1313
Median length1010
Mean length9.5008549.497766667
Min length77

Characters and Unicode

 Full DatasetStratified Sample
Total characters9500854284933
Distinct characters1717
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowSelf-EmployedRetired
2nd rowUnemployedRetired
3rd rowSelf-EmployedEmployed
4th rowEmployedSelf-Employed
5th rowEmployedEmployed
ValueCountFrequency (%)
employed 250857
25.1%
unemployed 250117
25.0%
self-employed 249941
25.0%
retired 249085
24.9%
ValueCountFrequency (%)
retired 7545
25.1%
self-employed 7534
25.1%
employed 7517
25.1%
unemployed 7404
24.7%
2025-06-06T02:28:41.155331image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 1749143
18.4%
l 1000856
10.5%
d 1000000
10.5%
o 750915
7.9%
m 750915
7.9%
y 750915
7.9%
p 750915
7.9%
E 500798
 
5.3%
U 250117
 
2.6%
n 250117
 
2.6%
Other values (7) 1746163
18.4%
ValueCountFrequency (%)
e 52483
18.4%
d 30000
10.5%
l 29989
10.5%
o 22455
7.9%
p 22455
7.9%
m 22455
7.9%
y 22455
7.9%
E 15051
 
5.3%
R 7545
 
2.6%
t 7545
 
2.6%
Other values (7) 52500
18.4%

Most occurring categories

ValueCountFrequency (%)
(unknown) 9500854
100.0%
ValueCountFrequency (%)
(unknown) 284933
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 1749143
18.4%
l 1000856
10.5%
d 1000000
10.5%
o 750915
7.9%
m 750915
7.9%
y 750915
7.9%
p 750915
7.9%
E 500798
 
5.3%
U 250117
 
2.6%
n 250117
 
2.6%
Other values (7) 1746163
18.4%
ValueCountFrequency (%)
e 52483
18.4%
d 30000
10.5%
l 29989
10.5%
o 22455
7.9%
p 22455
7.9%
m 22455
7.9%
y 22455
7.9%
E 15051
 
5.3%
R 7545
 
2.6%
t 7545
 
2.6%
Other values (7) 52500
18.4%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 9500854
100.0%
ValueCountFrequency (%)
(unknown) 284933
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 1749143
18.4%
l 1000856
10.5%
d 1000000
10.5%
o 750915
7.9%
m 750915
7.9%
y 750915
7.9%
p 750915
7.9%
E 500798
 
5.3%
U 250117
 
2.6%
n 250117
 
2.6%
Other values (7) 1746163
18.4%
ValueCountFrequency (%)
e 52483
18.4%
d 30000
10.5%
l 29989
10.5%
o 22455
7.9%
p 22455
7.9%
m 22455
7.9%
y 22455
7.9%
E 15051
 
5.3%
R 7545
 
2.6%
t 7545
 
2.6%
Other values (7) 52500
18.4%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 9500854
100.0%
ValueCountFrequency (%)
(unknown) 284933
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 1749143
18.4%
l 1000856
10.5%
d 1000000
10.5%
o 750915
7.9%
m 750915
7.9%
y 750915
7.9%
p 750915
7.9%
E 500798
 
5.3%
U 250117
 
2.6%
n 250117
 
2.6%
Other values (7) 1746163
18.4%
ValueCountFrequency (%)
e 52483
18.4%
d 30000
10.5%
l 29989
10.5%
o 22455
7.9%
p 22455
7.9%
m 22455
7.9%
y 22455
7.9%
E 15051
 
5.3%
R 7545
 
2.6%
t 7545
 
2.6%
Other values (7) 52500
18.4%

transaction_id
Real number (ℝ)

 Full DatasetStratified Sample
Distinct63257629538
Distinct (%)63.3%98.5%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean499891.7314502817.3992
 Full DatasetStratified Sample
Minimum230
Maximum999999999881
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:41.337985image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum230
5-th percentile50200.9551972.95
Q1249878.75255453.25
median499559.5502048.5
Q3750071.25751640.75
95-th percentile950045.2950411.5
Maximum999999999881
Range999997999851
Interquartile range (IQR)500192.5496187.5

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation288706.0577287335.4613
Coefficient of variation (CV)0.57753717350.5714509119
Kurtosis-1.200114605-1.184605739
Mean499891.7314502817.3992
Median Absolute Deviation (MAD)250088.5248150.5
Skewness0.002395187253-0.009883747709
Sum4.998917314 × 10111.508452198 × 1010
Variance8.335118772 × 10108.256166732 × 1010
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:41.554273image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
115913 9
 
< 0.1%
504562 8
 
< 0.1%
344167 8
 
< 0.1%
2773 8
 
< 0.1%
239407 8
 
< 0.1%
620816 8
 
< 0.1%
273197 8
 
< 0.1%
254678 7
 
< 0.1%
798940 7
 
< 0.1%
335691 7
 
< 0.1%
Other values (632566) 999922
> 99.9%
ValueCountFrequency (%)
491609 3
 
< 0.1%
6554 3
 
< 0.1%
318501 3
 
< 0.1%
346938 3
 
< 0.1%
832355 3
 
< 0.1%
974493 2
 
< 0.1%
233193 2
 
< 0.1%
108085 2
 
< 0.1%
496964 2
 
< 0.1%
616078 2
 
< 0.1%
Other values (29528) 29975
99.9%
ValueCountFrequency (%)
2 2
< 0.1%
3 1
 
< 0.1%
5 3
< 0.1%
6 1
 
< 0.1%
7 2
< 0.1%
ValueCountFrequency (%)
30 1
< 0.1%
36 1
< 0.1%
55 1
< 0.1%
63 1
< 0.1%
69 1
< 0.1%
ValueCountFrequency (%)
30 1
< 0.1%
36 1
< 0.1%
55 1
< 0.1%
63 1
< 0.1%
69 1
< 0.1%
ValueCountFrequency (%)
2 2
< 0.1%
3 1
 
< 0.1%
5 3
< 0.1%
6 1
 
< 0.1%
7 2
< 0.1%

transaction_date
['Text', 'Text']

 Full DatasetStratified Sample
Distinct99223129990
Distinct (%)99.2%> 99.9%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:42.262800image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length1919
Median length1919
Mean length1919
Min length1919

Characters and Unicode

 Full DatasetStratified Sample
Total characters19000000570000
Distinct characters1313
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique98450429980 ?
Unique (%)98.5%99.9%

Sample

 Full DatasetStratified Sample
1st row2020-10-11 10:08:522021-01-28 14:56:18
2nd row2021-12-08 01:07:402021-08-05 22:27:14
3rd row2020-02-17 09:40:482020-11-29 01:27:56
4th row2020-08-13 00:43:142020-10-20 00:00:36
5th row2021-07-02 11:59:032021-01-01 15:53:55
ValueCountFrequency (%)
2020-10-05 1509
 
0.1%
2020-09-06 1467
 
0.1%
2020-10-04 1464
 
0.1%
2020-07-26 1463
 
0.1%
2020-02-26 1458
 
0.1%
2020-05-03 1455
 
0.1%
2021-02-27 1453
 
0.1%
2021-07-30 1451
 
0.1%
2020-09-07 1451
 
0.1%
2020-10-09 1447
 
0.1%
Other values (87119) 1985382
99.3%
ValueCountFrequency (%)
2021-08-13 61
 
0.1%
2020-10-17 60
 
0.1%
2020-11-27 60
 
0.1%
2021-11-09 60
 
0.1%
2021-11-24 59
 
0.1%
2021-06-25 57
 
0.1%
2021-08-24 57
 
0.1%
2020-09-11 56
 
0.1%
2021-04-23 56
 
0.1%
2021-12-15 56
 
0.1%
Other values (26058) 59418
99.0%
2025-06-06T02:28:43.113753image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3798683
20.0%
2 3414072
18.0%
1 2439659
12.8%
: 2000000
10.5%
- 2000000
10.5%
1000000
 
5.3%
3 890067
 
4.7%
5 800311
 
4.2%
4 798703
 
4.2%
7 467556
 
2.5%
Other values (3) 1390949
 
7.3%
ValueCountFrequency (%)
0 113957
20.0%
2 102327
18.0%
1 73132
12.8%
- 60000
10.5%
: 60000
10.5%
30000
 
5.3%
3 26926
 
4.7%
5 23976
 
4.2%
4 23960
 
4.2%
7 14065
 
2.5%
Other values (3) 41657
 
7.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 19000000
100.0%
ValueCountFrequency (%)
(unknown) 570000
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 3798683
20.0%
2 3414072
18.0%
1 2439659
12.8%
: 2000000
10.5%
- 2000000
10.5%
1000000
 
5.3%
3 890067
 
4.7%
5 800311
 
4.2%
4 798703
 
4.2%
7 467556
 
2.5%
Other values (3) 1390949
 
7.3%
ValueCountFrequency (%)
0 113957
20.0%
2 102327
18.0%
1 73132
12.8%
- 60000
10.5%
: 60000
10.5%
30000
 
5.3%
3 26926
 
4.7%
5 23976
 
4.2%
4 23960
 
4.2%
7 14065
 
2.5%
Other values (3) 41657
 
7.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 19000000
100.0%
ValueCountFrequency (%)
(unknown) 570000
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 3798683
20.0%
2 3414072
18.0%
1 2439659
12.8%
: 2000000
10.5%
- 2000000
10.5%
1000000
 
5.3%
3 890067
 
4.7%
5 800311
 
4.2%
4 798703
 
4.2%
7 467556
 
2.5%
Other values (3) 1390949
 
7.3%
ValueCountFrequency (%)
0 113957
20.0%
2 102327
18.0%
1 73132
12.8%
- 60000
10.5%
: 60000
10.5%
30000
 
5.3%
3 26926
 
4.7%
5 23976
 
4.2%
4 23960
 
4.2%
7 14065
 
2.5%
Other values (3) 41657
 
7.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 19000000
100.0%
ValueCountFrequency (%)
(unknown) 570000
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 3798683
20.0%
2 3414072
18.0%
1 2439659
12.8%
: 2000000
10.5%
- 2000000
10.5%
1000000
 
5.3%
3 890067
 
4.7%
5 800311
 
4.2%
4 798703
 
4.2%
7 467556
 
2.5%
Other values (3) 1390949
 
7.3%
ValueCountFrequency (%)
0 113957
20.0%
2 102327
18.0%
1 73132
12.8%
- 60000
10.5%
: 60000
10.5%
30000
 
5.3%
3 26926
 
4.7%
5 23976
 
4.2%
4 23960
 
4.2%
7 14065
 
2.5%
Other values (3) 41657
 
7.3%

product_id
Real number (ℝ)

 Full DatasetStratified Sample
Distinct99999507
Distinct (%)1.0%31.7%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean4999.5645155014.683167
 Full DatasetStratified Sample
Minimum11
Maximum99999999
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:43.392466image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum11
5-th percentile500510
Q124982528
median49995026.5
Q374987510.25
95-th percentile94999490.05
Maximum99999999
Range99989998
Interquartile range (IQR)50004982.25

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation2886.7983912882.480717
Coefficient of variation (CV)0.57740996890.5748081426
Kurtosis-1.200144352-1.197819312
Mean4999.5645155014.683167
Median Absolute Deviation (MAD)25002490.5
Skewness0.0002346107222-0.00782393366
Sum4999564515150440495
Variance8333604.958308695.082
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:43.683084image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4898 145
 
< 0.1%
51 143
 
< 0.1%
9593 141
 
< 0.1%
5427 138
 
< 0.1%
3923 137
 
< 0.1%
8365 135
 
< 0.1%
4541 134
 
< 0.1%
2590 134
 
< 0.1%
467 133
 
< 0.1%
3676 133
 
< 0.1%
Other values (9989) 998627
99.9%
ValueCountFrequency (%)
3589 10
 
< 0.1%
1491 10
 
< 0.1%
2635 10
 
< 0.1%
4705 10
 
< 0.1%
6613 10
 
< 0.1%
5665 10
 
< 0.1%
6931 9
 
< 0.1%
5435 9
 
< 0.1%
278 9
 
< 0.1%
5777 9
 
< 0.1%
Other values (9497) 29904
99.7%
ValueCountFrequency (%)
1 92
< 0.1%
2 107
< 0.1%
3 117
< 0.1%
4 97
< 0.1%
5 92
< 0.1%
ValueCountFrequency (%)
1 3
< 0.1%
2 4
< 0.1%
3 3
< 0.1%
4 3
< 0.1%
5 3
< 0.1%
ValueCountFrequency (%)
1 3
< 0.1%
2 4
< 0.1%
3 3
< 0.1%
4 3
< 0.1%
5 3
< 0.1%
ValueCountFrequency (%)
1 92
0.3%
2 107
0.4%
3 117
0.4%
4 97
0.3%
5 92
0.3%

product_category
['Text', 'Text']

 Full DatasetStratified Sample
Distinct55
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:44.078424image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length1111
Median length99
Mean length8.1963898.208833333
Min length44

Characters and Unicode

 Full DatasetStratified Sample
Total characters8196389246265
Distinct characters1818
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowElectronicsClothing
2nd rowGroceriesElectronics
3rd rowToysGroceries
4th rowToysFurniture
5th rowClothingFurniture
ValueCountFrequency (%)
toys 200669
20.1%
groceries 200214
20.0%
clothing 199778
20.0%
electronics 199756
20.0%
furniture 199583
20.0%
ValueCountFrequency (%)
groceries 6036
20.1%
furniture 6016
20.1%
clothing 6010
20.0%
electronics 5995
20.0%
toys 5943
19.8%
2025-06-06T02:28:44.591282image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
r 999350
12.2%
o 800417
9.8%
e 799767
9.8%
i 799331
9.8%
s 600639
 
7.3%
c 599726
 
7.3%
n 599117
 
7.3%
t 599117
 
7.3%
l 399534
 
4.9%
u 399166
 
4.9%
Other values (8) 1600225
19.5%
ValueCountFrequency (%)
r 30099
12.2%
e 24083
9.8%
i 24057
9.8%
o 23984
9.7%
c 18026
 
7.3%
n 18021
 
7.3%
t 18021
 
7.3%
s 17974
 
7.3%
u 12032
 
4.9%
l 12005
 
4.9%
Other values (8) 47963
19.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 8196389
100.0%
ValueCountFrequency (%)
(unknown) 246265
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
r 999350
12.2%
o 800417
9.8%
e 799767
9.8%
i 799331
9.8%
s 600639
 
7.3%
c 599726
 
7.3%
n 599117
 
7.3%
t 599117
 
7.3%
l 399534
 
4.9%
u 399166
 
4.9%
Other values (8) 1600225
19.5%
ValueCountFrequency (%)
r 30099
12.2%
e 24083
9.8%
i 24057
9.8%
o 23984
9.7%
c 18026
 
7.3%
n 18021
 
7.3%
t 18021
 
7.3%
s 17974
 
7.3%
u 12032
 
4.9%
l 12005
 
4.9%
Other values (8) 47963
19.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 8196389
100.0%
ValueCountFrequency (%)
(unknown) 246265
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
r 999350
12.2%
o 800417
9.8%
e 799767
9.8%
i 799331
9.8%
s 600639
 
7.3%
c 599726
 
7.3%
n 599117
 
7.3%
t 599117
 
7.3%
l 399534
 
4.9%
u 399166
 
4.9%
Other values (8) 1600225
19.5%
ValueCountFrequency (%)
r 30099
12.2%
e 24083
9.8%
i 24057
9.8%
o 23984
9.7%
c 18026
 
7.3%
n 18021
 
7.3%
t 18021
 
7.3%
s 17974
 
7.3%
u 12032
 
4.9%
l 12005
 
4.9%
Other values (8) 47963
19.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 8196389
100.0%
ValueCountFrequency (%)
(unknown) 246265
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
r 999350
12.2%
o 800417
9.8%
e 799767
9.8%
i 799331
9.8%
s 600639
 
7.3%
c 599726
 
7.3%
n 599117
 
7.3%
t 599117
 
7.3%
l 399534
 
4.9%
u 399166
 
4.9%
Other values (8) 1600225
19.5%
ValueCountFrequency (%)
r 30099
12.2%
e 24083
9.8%
i 24057
9.8%
o 23984
9.7%
c 18026
 
7.3%
n 18021
 
7.3%
t 18021
 
7.3%
s 17974
 
7.3%
u 12032
 
4.9%
l 12005
 
4.9%
Other values (8) 47963
19.5%

quantity
Real number (ℝ)

 Full DatasetStratified Sample
Distinct99
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean5.0026495.016133333
 Full DatasetStratified Sample
Minimum11
Maximum99
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:44.744758image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum11
5-th percentile11
Q133
median55
Q377
95-th percentile99
Maximum99
Range88
Interquartile range (IQR)44

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation2.5837512762.584742878
Coefficient of variation (CV)0.5164766260.5152859197
Kurtosis-1.231080652-1.230965354
Mean5.0026495.016133333
Median Absolute Deviation (MAD)22
Skewness-0.0003647460673-0.007367687857
Sum5002649150484
Variance6.6757706596.680895745
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:44.920269image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
9 111914
11.2%
3 111422
11.1%
7 111274
11.1%
1 111150
11.1%
4 111104
11.1%
6 111098
11.1%
2 110782
11.1%
8 110747
11.1%
5 110509
11.1%
ValueCountFrequency (%)
9 3383
11.3%
7 3364
11.2%
8 3354
11.2%
4 3344
11.1%
5 3342
11.1%
1 3327
11.1%
3 3316
11.1%
6 3289
11.0%
2 3281
10.9%
ValueCountFrequency (%)
1 111150
11.1%
2 110782
11.1%
3 111422
11.1%
4 111104
11.1%
5 110509
11.1%
ValueCountFrequency (%)
1 3327
11.1%
2 3281
10.9%
3 3316
11.1%
4 3344
11.1%
5 3342
11.1%
ValueCountFrequency (%)
1 3327
0.3%
2 3281
0.3%
3 3316
0.3%
4 3344
0.3%
5 3342
0.3%
ValueCountFrequency (%)
1 111150
370.5%
2 110782
369.3%
3 111422
371.4%
4 111104
370.3%
5 110509
368.4%

unit_price
Real number (ℝ)

 Full DatasetStratified Sample
Distinct9989625919
Distinct (%)10.0%86.4%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean500.2613169499.671594
 Full DatasetStratified Sample
Minimum11
Maximum1000999.96
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:45.199543image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum11
5-th percentile50.7250.889
Q1250.31248.755
median500.41498.76
Q3750.16751.6825
95-th percentile949.91949.4015
Maximum1000999.96
Range999998.96
Interquartile range (IQR)499.85502.9275

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation288.4628596288.6536702
Coefficient of variation (CV)0.57662435590.5776867719
Kurtosis-1.20144233-1.198680323
Mean500.2613169499.671594
Median Absolute Deviation (MAD)249.93251.41
Skewness-1.097330655 × 10-50.00333989629
Sum500261316.914990147.82
Variance83210.8213983320.94129
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:45.531342image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
226.51 28
 
< 0.1%
450.02 26
 
< 0.1%
591.8 25
 
< 0.1%
921.47 25
 
< 0.1%
354.83 25
 
< 0.1%
49.69 25
 
< 0.1%
111.41 24
 
< 0.1%
954.1 24
 
< 0.1%
619.19 24
 
< 0.1%
845.21 24
 
< 0.1%
Other values (99886) 999750
> 99.9%
ValueCountFrequency (%)
998.99 5
 
< 0.1%
640.48 5
 
< 0.1%
697.24 4
 
< 0.1%
530.24 4
 
< 0.1%
742.95 4
 
< 0.1%
855.69 4
 
< 0.1%
729.07 4
 
< 0.1%
765.64 4
 
< 0.1%
119.68 4
 
< 0.1%
720.62 4
 
< 0.1%
Other values (25909) 29958
99.9%
ValueCountFrequency (%)
1 7
< 0.1%
1.01 9
< 0.1%
1.02 11
< 0.1%
1.03 8
< 0.1%
1.04 17
< 0.1%
ValueCountFrequency (%)
1 1
< 0.1%
1.03 1
< 0.1%
1.04 1
< 0.1%
1.06 1
< 0.1%
1.08 2
< 0.1%
ValueCountFrequency (%)
1 1
< 0.1%
1.03 1
< 0.1%
1.04 1
< 0.1%
1.06 1
< 0.1%
1.08 2
< 0.1%
ValueCountFrequency (%)
1 7
< 0.1%
1.01 9
< 0.1%
1.02 11
< 0.1%
1.03 8
< 0.1%
1.04 17
0.1%

discount_applied
Real number (ℝ)

 Full DatasetStratified Sample
Distinct5151
Distinct (%)< 0.1%0.2%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean0.249910490.2490266667
 Full DatasetStratified Sample
Minimum00
Maximum0.50.5
Zeros9967280
Zeros (%)1.0%0.9%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:45.898721image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum00
5-th percentile0.030.03
Q10.130.12
median0.250.25
Q30.370.37
95-th percentile0.470.47
Maximum0.50.5
Range0.50.5
Interquartile range (IQR)0.240.25

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation0.14432790830.1439335605
Coefficient of variation (CV)0.57751840790.5779845286
Kurtosis-1.19713108-1.195219351
Mean0.249910490.2490266667
Median Absolute Deviation (MAD)0.120.12
Skewness0.00026409763360.01064049997
Sum249910.497470.8
Variance0.020830545120.02071686985
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:46.533483image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.19 20302
 
2.0%
0.06 20213
 
2.0%
0.34 20211
 
2.0%
0.03 20207
 
2.0%
0.05 20207
 
2.0%
0.21 20199
 
2.0%
0.29 20155
 
2.0%
0.07 20153
 
2.0%
0.43 20145
 
2.0%
0.18 20111
 
2.0%
Other values (41) 798097
79.8%
ValueCountFrequency (%)
0.07 658
 
2.2%
0.4 658
 
2.2%
0.12 645
 
2.1%
0.39 632
 
2.1%
0.03 631
 
2.1%
0.37 628
 
2.1%
0.13 628
 
2.1%
0.24 627
 
2.1%
0.15 626
 
2.1%
0.19 625
 
2.1%
Other values (41) 23642
78.8%
ValueCountFrequency (%)
0 9967
1.0%
0.01 20018
2.0%
0.02 19788
2.0%
0.03 20207
2.0%
0.04 19947
2.0%
ValueCountFrequency (%)
0 280
0.9%
0.01 570
1.9%
0.02 613
2.0%
0.03 631
2.1%
0.04 594
2.0%
ValueCountFrequency (%)
0 280
< 0.1%
0.01 570
0.1%
0.02 613
0.1%
0.03 631
0.1%
0.04 594
0.1%
ValueCountFrequency (%)
0 9967
33.2%
0.01 20018
66.7%
0.02 19788
66.0%
0.03 20207
67.4%
0.04 19947
66.5%

payment_method
['Text', 'Text']

 Full DatasetStratified Sample
Distinct44
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:46.792510image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length1414
Median length1111
Mean length9.7519359.7828
Min length44

Characters and Unicode

 Full DatasetStratified Sample
Total characters9751935293484
Distinct characters1919
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowCredit CardDebit Card
2nd rowCredit CardDebit Card
3rd rowDebit CardDebit Card
4th rowCredit CardDebit Card
5th rowMobile PaymentDebit Card
ValueCountFrequency (%)
card 500200
28.6%
credit 250435
14.3%
mobile 250030
14.3%
payment 250030
14.3%
cash 249770
14.3%
debit 249765
14.3%
ValueCountFrequency (%)
card 14999
28.5%
debit 7619
14.5%
mobile 7611
14.5%
payment 7611
14.5%
cash 7390
14.0%
credit 7380
14.0%
2025-06-06T02:28:47.125903image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
C 1000405
10.3%
e 1000260
10.3%
a 1000000
10.3%
r 750635
 
7.7%
d 750635
 
7.7%
t 750230
 
7.7%
i 750230
 
7.7%
750230
 
7.7%
b 499795
 
5.1%
M 250030
 
2.6%
Other values (9) 2249485
23.1%
ValueCountFrequency (%)
e 30221
10.3%
a 30000
10.2%
C 29769
10.1%
22610
 
7.7%
i 22610
 
7.7%
t 22610
 
7.7%
r 22379
 
7.6%
d 22379
 
7.6%
b 15230
 
5.2%
D 7619
 
2.6%
Other values (9) 68057
23.2%

Most occurring categories

ValueCountFrequency (%)
(unknown) 9751935
100.0%
ValueCountFrequency (%)
(unknown) 293484
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
C 1000405
10.3%
e 1000260
10.3%
a 1000000
10.3%
r 750635
 
7.7%
d 750635
 
7.7%
t 750230
 
7.7%
i 750230
 
7.7%
750230
 
7.7%
b 499795
 
5.1%
M 250030
 
2.6%
Other values (9) 2249485
23.1%
ValueCountFrequency (%)
e 30221
10.3%
a 30000
10.2%
C 29769
10.1%
22610
 
7.7%
i 22610
 
7.7%
t 22610
 
7.7%
r 22379
 
7.6%
d 22379
 
7.6%
b 15230
 
5.2%
D 7619
 
2.6%
Other values (9) 68057
23.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 9751935
100.0%
ValueCountFrequency (%)
(unknown) 293484
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
C 1000405
10.3%
e 1000260
10.3%
a 1000000
10.3%
r 750635
 
7.7%
d 750635
 
7.7%
t 750230
 
7.7%
i 750230
 
7.7%
750230
 
7.7%
b 499795
 
5.1%
M 250030
 
2.6%
Other values (9) 2249485
23.1%
ValueCountFrequency (%)
e 30221
10.3%
a 30000
10.2%
C 29769
10.1%
22610
 
7.7%
i 22610
 
7.7%
t 22610
 
7.7%
r 22379
 
7.6%
d 22379
 
7.6%
b 15230
 
5.2%
D 7619
 
2.6%
Other values (9) 68057
23.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 9751935
100.0%
ValueCountFrequency (%)
(unknown) 293484
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
C 1000405
10.3%
e 1000260
10.3%
a 1000000
10.3%
r 750635
 
7.7%
d 750635
 
7.7%
t 750230
 
7.7%
i 750230
 
7.7%
750230
 
7.7%
b 499795
 
5.1%
M 250030
 
2.6%
Other values (9) 2249485
23.1%
ValueCountFrequency (%)
e 30221
10.3%
a 30000
10.2%
C 29769
10.1%
22610
 
7.7%
i 22610
 
7.7%
t 22610
 
7.7%
r 22379
 
7.6%
d 22379
 
7.6%
b 15230
 
5.2%
D 7619
 
2.6%
Other values (9) 68057
23.2%

store_location
['Text', 'Text']

 Full DatasetStratified Sample
Distinct44
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:47.311369image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length1010
Median length1010
Mean length1010
Min length1010

Characters and Unicode

 Full DatasetStratified Sample
Total characters10000000300000
Distinct characters1212
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowLocation ALocation C
2nd rowLocation CLocation A
3rd rowLocation ALocation B
4th rowLocation ALocation C
5th rowLocation CLocation B
ValueCountFrequency (%)
location 1000000
50.0%
c 250336
 
12.5%
b 250280
 
12.5%
a 250150
 
12.5%
d 249234
 
12.5%
ValueCountFrequency (%)
location 30000
50.0%
c 7569
 
12.6%
a 7562
 
12.6%
b 7501
 
12.5%
d 7368
 
12.3%
2025-06-06T02:28:47.595027image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 2000000
20.0%
L 1000000
10.0%
c 1000000
10.0%
a 1000000
10.0%
t 1000000
10.0%
i 1000000
10.0%
n 1000000
10.0%
1000000
10.0%
C 250336
 
2.5%
B 250280
 
2.5%
Other values (2) 499384
 
5.0%
ValueCountFrequency (%)
o 60000
20.0%
L 30000
10.0%
c 30000
10.0%
a 30000
10.0%
t 30000
10.0%
i 30000
10.0%
n 30000
10.0%
30000
10.0%
C 7569
 
2.5%
A 7562
 
2.5%
Other values (2) 14869
 
5.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 10000000
100.0%
ValueCountFrequency (%)
(unknown) 300000
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o 2000000
20.0%
L 1000000
10.0%
c 1000000
10.0%
a 1000000
10.0%
t 1000000
10.0%
i 1000000
10.0%
n 1000000
10.0%
1000000
10.0%
C 250336
 
2.5%
B 250280
 
2.5%
Other values (2) 499384
 
5.0%
ValueCountFrequency (%)
o 60000
20.0%
L 30000
10.0%
c 30000
10.0%
a 30000
10.0%
t 30000
10.0%
i 30000
10.0%
n 30000
10.0%
30000
10.0%
C 7569
 
2.5%
A 7562
 
2.5%
Other values (2) 14869
 
5.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 10000000
100.0%
ValueCountFrequency (%)
(unknown) 300000
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o 2000000
20.0%
L 1000000
10.0%
c 1000000
10.0%
a 1000000
10.0%
t 1000000
10.0%
i 1000000
10.0%
n 1000000
10.0%
1000000
10.0%
C 250336
 
2.5%
B 250280
 
2.5%
Other values (2) 499384
 
5.0%
ValueCountFrequency (%)
o 60000
20.0%
L 30000
10.0%
c 30000
10.0%
a 30000
10.0%
t 30000
10.0%
i 30000
10.0%
n 30000
10.0%
30000
10.0%
C 7569
 
2.5%
A 7562
 
2.5%
Other values (2) 14869
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 10000000
100.0%
ValueCountFrequency (%)
(unknown) 300000
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o 2000000
20.0%
L 1000000
10.0%
c 1000000
10.0%
a 1000000
10.0%
t 1000000
10.0%
i 1000000
10.0%
n 1000000
10.0%
1000000
10.0%
C 250336
 
2.5%
B 250280
 
2.5%
Other values (2) 499384
 
5.0%
ValueCountFrequency (%)
o 60000
20.0%
L 30000
10.0%
c 30000
10.0%
a 30000
10.0%
t 30000
10.0%
i 30000
10.0%
n 30000
10.0%
30000
10.0%
C 7569
 
2.5%
A 7562
 
2.5%
Other values (2) 14869
 
5.0%

transaction_hour
Real number (ℝ)

 Full DatasetStratified Sample
Distinct2424
Distinct (%)< 0.1%0.1%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean11.50519311.50666667
 Full DatasetStratified Sample
Minimum00
Maximum2323
Zeros417561277
Zeros (%)4.2%4.3%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:47.712768image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum00
5-th percentile11
Q156
median1211
Q31818
95-th percentile2222
Maximum2323
Range2323
Interquartile range (IQR)1312

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation6.9244597616.939118143
Coefficient of variation (CV)0.60185515890.6030519823
Kurtosis-1.205305317-1.202551467
Mean11.50519311.50666667
Median Absolute Deviation (MAD)66
Skewness-0.001531297707-0.002854367692
Sum11505193345200
Variance47.9481429848.1513606
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:47.855564image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
5 42166
 
4.2%
14 42161
 
4.2%
18 41872
 
4.2%
20 41812
 
4.2%
3 41780
 
4.2%
21 41778
 
4.2%
4 41756
 
4.2%
0 41756
 
4.2%
23 41750
 
4.2%
19 41707
 
4.2%
Other values (14) 581462
58.1%
ValueCountFrequency (%)
10 1292
 
4.3%
1 1287
 
4.3%
20 1286
 
4.3%
23 1283
 
4.3%
22 1280
 
4.3%
2 1279
 
4.3%
0 1277
 
4.3%
14 1274
 
4.2%
17 1258
 
4.2%
11 1257
 
4.2%
Other values (14) 17227
57.4%
ValueCountFrequency (%)
0 41756
4.2%
1 41637
4.2%
2 41388
4.1%
3 41780
4.2%
4 41756
4.2%
ValueCountFrequency (%)
0 1277
4.3%
1 1287
4.3%
2 1279
4.3%
3 1202
4.0%
4 1196
4.0%
ValueCountFrequency (%)
0 1277
0.1%
1 1287
0.1%
2 1279
0.1%
3 1202
0.1%
4 1196
0.1%
ValueCountFrequency (%)
0 41756
139.2%
1 41637
138.8%
2 41388
138.0%
3 41780
139.3%
4 41756
139.2%

day_of_week
['Text', 'Text']

 Full DatasetStratified Sample
Distinct77
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:48.139353image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length99
Median length88
Mean length7.1410757.146633333
Min length66

Characters and Unicode

 Full DatasetStratified Sample
Total characters7141075214399
Distinct characters1717
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowWednesdayFriday
2nd rowFridaySaturday
3rd rowSaturdaySunday
4th rowFridaySunday
5th rowMondayFriday
ValueCountFrequency (%)
tuesday 143452
14.3%
friday 143067
14.3%
thursday 142930
14.3%
sunday 142875
14.3%
monday 142855
14.3%
saturday 142700
14.3%
wednesday 142121
14.2%
ValueCountFrequency (%)
tuesday 4332
14.4%
sunday 4319
14.4%
wednesday 4311
14.4%
thursday 4286
14.3%
monday 4282
14.3%
saturday 4281
14.3%
friday 4189
14.0%
2025-06-06T02:28:48.511091image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 1142700
16.0%
d 1142121
16.0%
y 1000000
14.0%
u 571957
8.0%
r 428697
 
6.0%
s 428503
 
6.0%
n 427851
 
6.0%
e 427694
 
6.0%
T 286382
 
4.0%
S 285575
 
4.0%
Other values (7) 999595
14.0%
ValueCountFrequency (%)
d 34311
16.0%
a 34281
16.0%
y 30000
14.0%
u 17218
8.0%
e 12954
 
6.0%
s 12929
 
6.0%
n 12912
 
6.0%
r 12756
 
5.9%
T 8618
 
4.0%
S 8600
 
4.0%
Other values (7) 29820
13.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 7141075
100.0%
ValueCountFrequency (%)
(unknown) 214399
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 1142700
16.0%
d 1142121
16.0%
y 1000000
14.0%
u 571957
8.0%
r 428697
 
6.0%
s 428503
 
6.0%
n 427851
 
6.0%
e 427694
 
6.0%
T 286382
 
4.0%
S 285575
 
4.0%
Other values (7) 999595
14.0%
ValueCountFrequency (%)
d 34311
16.0%
a 34281
16.0%
y 30000
14.0%
u 17218
8.0%
e 12954
 
6.0%
s 12929
 
6.0%
n 12912
 
6.0%
r 12756
 
5.9%
T 8618
 
4.0%
S 8600
 
4.0%
Other values (7) 29820
13.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 7141075
100.0%
ValueCountFrequency (%)
(unknown) 214399
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 1142700
16.0%
d 1142121
16.0%
y 1000000
14.0%
u 571957
8.0%
r 428697
 
6.0%
s 428503
 
6.0%
n 427851
 
6.0%
e 427694
 
6.0%
T 286382
 
4.0%
S 285575
 
4.0%
Other values (7) 999595
14.0%
ValueCountFrequency (%)
d 34311
16.0%
a 34281
16.0%
y 30000
14.0%
u 17218
8.0%
e 12954
 
6.0%
s 12929
 
6.0%
n 12912
 
6.0%
r 12756
 
5.9%
T 8618
 
4.0%
S 8600
 
4.0%
Other values (7) 29820
13.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 7141075
100.0%
ValueCountFrequency (%)
(unknown) 214399
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 1142700
16.0%
d 1142121
16.0%
y 1000000
14.0%
u 571957
8.0%
r 428697
 
6.0%
s 428503
 
6.0%
n 427851
 
6.0%
e 427694
 
6.0%
T 286382
 
4.0%
S 285575
 
4.0%
Other values (7) 999595
14.0%
ValueCountFrequency (%)
d 34311
16.0%
a 34281
16.0%
y 30000
14.0%
u 17218
8.0%
e 12954
 
6.0%
s 12929
 
6.0%
n 12912
 
6.0%
r 12756
 
5.9%
T 8618
 
4.0%
S 8600
 
4.0%
Other values (7) 29820
13.9%

week_of_year
Real number (ℝ)

 Full DatasetStratified Sample
Distinct5252
Distinct (%)< 0.1%0.2%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean26.50369126.49406667
 Full DatasetStratified Sample
Minimum11
Maximum5252
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:48.690343image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum11
5-th percentile33
Q11413
median2726
Q33940
95-th percentile5050
Maximum5252
Range5151
Interquartile range (IQR)2527

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation15.0051651615.04609953
Coefficient of variation (CV)0.5661537920.5679044941
Kurtosis-1.199199248-1.207693619
Mean26.50369126.49406667
Median Absolute Deviation (MAD)1313
Skewness-0.00059099783510.002932240747
Sum26503691794822
Variance225.1549815226.385111
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:48.890442image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
27 19588
 
2.0%
19 19507
 
2.0%
51 19447
 
1.9%
26 19425
 
1.9%
1 19399
 
1.9%
25 19386
 
1.9%
21 19371
 
1.9%
44 19356
 
1.9%
16 19348
 
1.9%
9 19340
 
1.9%
Other values (42) 805833
80.6%
ValueCountFrequency (%)
10 624
 
2.1%
32 624
 
2.1%
19 620
 
2.1%
43 611
 
2.0%
47 606
 
2.0%
3 606
 
2.0%
21 604
 
2.0%
25 596
 
2.0%
17 595
 
2.0%
9 593
 
2.0%
Other values (42) 23921
79.7%
ValueCountFrequency (%)
1 19399
1.9%
2 19179
1.9%
3 19150
1.9%
4 19137
1.9%
5 19328
1.9%
ValueCountFrequency (%)
1 570
1.9%
2 592
2.0%
3 606
2.0%
4 556
1.9%
5 586
2.0%
ValueCountFrequency (%)
1 570
0.1%
2 592
0.1%
3 606
0.1%
4 556
0.1%
5 586
0.1%
ValueCountFrequency (%)
1 19399
64.7%
2 19179
63.9%
3 19150
63.8%
4 19137
63.8%
5 19328
64.4%

month_of_year
Real number (ℝ)

 Full DatasetStratified Sample
Distinct1212
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean6.4974676.4968
 Full DatasetStratified Sample
Minimum11
Maximum1212
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:49.024893image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum11
5-th percentile11
Q133
median76
Q31010
95-th percentile1212
Maximum1212
Range1111
Interquartile range (IQR)77

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation3.4552119363.457695034
Coefficient of variation (CV)0.5317782970.5322150957
Kurtosis-1.21903855-1.218478263
Mean6.4974676.4968
Median Absolute Deviation (MAD)33
Skewness0.00038204364360.002125819773
Sum6497467194904
Variance11.9384895211.95565495
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:49.151376image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
2 83951
8.4%
11 83645
8.4%
1 83624
8.4%
7 83475
8.3%
12 83353
8.3%
9 83328
8.3%
3 83244
8.3%
5 83135
8.3%
10 83113
8.3%
8 83093
8.3%
Other values (2) 166039
16.6%
ValueCountFrequency (%)
10 2541
8.5%
2 2538
8.5%
7 2537
8.5%
6 2526
8.4%
12 2521
8.4%
1 2515
8.4%
5 2514
8.4%
11 2489
8.3%
4 2473
8.2%
3 2466
8.2%
Other values (2) 4880
16.3%
ValueCountFrequency (%)
1 83624
8.4%
2 83951
8.4%
3 83244
8.3%
4 83091
8.3%
5 83135
8.3%
ValueCountFrequency (%)
1 2515
8.4%
2 2538
8.5%
3 2466
8.2%
4 2473
8.2%
5 2514
8.4%
ValueCountFrequency (%)
1 2515
0.3%
2 2538
0.3%
3 2466
0.2%
4 2473
0.2%
5 2514
0.3%
ValueCountFrequency (%)
1 83624
278.7%
2 83951
279.8%
3 83244
277.5%
4 83091
277.0%
5 83135
277.1%

avg_purchase_value
Real number (ℝ)

 Full DatasetStratified Sample
Distinct4900122451
Distinct (%)4.9%74.8%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean254.8864443255.0930827
 Full DatasetStratified Sample
Minimum1010.01
Maximum500499.97
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:49.331782image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum1010.01
5-th percentile34.433.79
Q1132.22131.975
median254.93256.13
Q3377.35378.32
95-th percentile475.56476.0405
Maximum500499.97
Range490489.96
Interquartile range (IQR)245.13246.345

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation141.4949233142.102325
Coefficient of variation (CV)0.55512926040.5570606757
Kurtosis-1.200170422-1.205574492
Mean254.8864443255.0930827
Median Absolute Deviation (MAD)122.57123.115
Skewness0.0003762833586-0.001960015767
Sum254886444.37652792.48
Variance20020.8133320193.07077
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:49.539602image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
76.54 41
 
< 0.1%
482.75 41
 
< 0.1%
372.04 40
 
< 0.1%
397.45 39
 
< 0.1%
246.87 39
 
< 0.1%
60.53 38
 
< 0.1%
278.34 38
 
< 0.1%
315.26 38
 
< 0.1%
165.81 38
 
< 0.1%
492.47 38
 
< 0.1%
Other values (48991) 999610
> 99.9%
ValueCountFrequency (%)
78.3 5
 
< 0.1%
194.92 5
 
< 0.1%
180.8 5
 
< 0.1%
228.66 5
 
< 0.1%
282.32 5
 
< 0.1%
158.4 5
 
< 0.1%
492.22 5
 
< 0.1%
373.44 5
 
< 0.1%
102.51 5
 
< 0.1%
78.13 5
 
< 0.1%
Other values (22441) 29950
99.8%
ValueCountFrequency (%)
10 8
 
< 0.1%
10.01 23
< 0.1%
10.02 29
< 0.1%
10.03 17
< 0.1%
10.04 21
< 0.1%
ValueCountFrequency (%)
10.01 2
< 0.1%
10.02 1
 
< 0.1%
10.03 1
 
< 0.1%
10.04 1
 
< 0.1%
10.05 3
< 0.1%
ValueCountFrequency (%)
10.01 2
< 0.1%
10.02 1
 
< 0.1%
10.03 1
 
< 0.1%
10.04 1
 
< 0.1%
10.05 3
< 0.1%
ValueCountFrequency (%)
10 8
 
< 0.1%
10.01 23
0.1%
10.02 29
0.1%
10.03 17
0.1%
10.04 21
0.1%

purchase_frequency
['Text', 'Text']

 Full DatasetStratified Sample
Distinct44
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:49.805435image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length77
Median length66
Mean length6.0003995.992
Min length55

Characters and Unicode

 Full DatasetStratified Sample
Total characters6000399179760
Distinct characters1515
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowWeeklyYearly
2nd rowDailyWeekly
3rd rowWeeklyWeekly
4th rowWeeklyWeekly
5th rowYearlyMonthly
ValueCountFrequency (%)
yearly 250767
25.1%
monthly 249932
25.0%
weekly 249768
25.0%
daily 249533
25.0%
ValueCountFrequency (%)
weekly 7579
25.3%
daily 7574
25.2%
yearly 7513
25.0%
monthly 7334
24.4%
2025-06-06T02:28:50.165209image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
l 1000000
16.7%
y 1000000
16.7%
e 750303
12.5%
a 500300
8.3%
Y 250767
 
4.2%
r 250767
 
4.2%
M 249932
 
4.2%
o 249932
 
4.2%
n 249932
 
4.2%
t 249932
 
4.2%
Other values (5) 1248534
20.8%
ValueCountFrequency (%)
l 30000
16.7%
y 30000
16.7%
e 22671
12.6%
a 15087
8.4%
W 7579
 
4.2%
k 7579
 
4.2%
D 7574
 
4.2%
i 7574
 
4.2%
Y 7513
 
4.2%
r 7513
 
4.2%
Other values (5) 36670
20.4%

Most occurring categories

ValueCountFrequency (%)
(unknown) 6000399
100.0%
ValueCountFrequency (%)
(unknown) 179760
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
l 1000000
16.7%
y 1000000
16.7%
e 750303
12.5%
a 500300
8.3%
Y 250767
 
4.2%
r 250767
 
4.2%
M 249932
 
4.2%
o 249932
 
4.2%
n 249932
 
4.2%
t 249932
 
4.2%
Other values (5) 1248534
20.8%
ValueCountFrequency (%)
l 30000
16.7%
y 30000
16.7%
e 22671
12.6%
a 15087
8.4%
W 7579
 
4.2%
k 7579
 
4.2%
D 7574
 
4.2%
i 7574
 
4.2%
Y 7513
 
4.2%
r 7513
 
4.2%
Other values (5) 36670
20.4%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 6000399
100.0%
ValueCountFrequency (%)
(unknown) 179760
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
l 1000000
16.7%
y 1000000
16.7%
e 750303
12.5%
a 500300
8.3%
Y 250767
 
4.2%
r 250767
 
4.2%
M 249932
 
4.2%
o 249932
 
4.2%
n 249932
 
4.2%
t 249932
 
4.2%
Other values (5) 1248534
20.8%
ValueCountFrequency (%)
l 30000
16.7%
y 30000
16.7%
e 22671
12.6%
a 15087
8.4%
W 7579
 
4.2%
k 7579
 
4.2%
D 7574
 
4.2%
i 7574
 
4.2%
Y 7513
 
4.2%
r 7513
 
4.2%
Other values (5) 36670
20.4%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 6000399
100.0%
ValueCountFrequency (%)
(unknown) 179760
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
l 1000000
16.7%
y 1000000
16.7%
e 750303
12.5%
a 500300
8.3%
Y 250767
 
4.2%
r 250767
 
4.2%
M 249932
 
4.2%
o 249932
 
4.2%
n 249932
 
4.2%
t 249932
 
4.2%
Other values (5) 1248534
20.8%
ValueCountFrequency (%)
l 30000
16.7%
y 30000
16.7%
e 22671
12.6%
a 15087
8.4%
W 7579
 
4.2%
k 7579
 
4.2%
D 7574
 
4.2%
i 7574
 
4.2%
Y 7513
 
4.2%
r 7513
 
4.2%
Other values (5) 36670
20.4%

last_purchase_date
['Text', 'Text']

 Full DatasetStratified Sample
Distinct98424229988
Distinct (%)98.4%> 99.9%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:50.844542image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length1919
Median length1919
Mean length1919
Min length1919

Characters and Unicode

 Full DatasetStratified Sample
Total characters19000000570000
Distinct characters1313
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique96865629976 ?
Unique (%)96.9%99.9%

Sample

 Full DatasetStratified Sample
1st row2021-09-11 04:22:382021-01-20 18:14:28
2nd row2021-05-16 12:01:162021-07-08 12:01:54
3rd row2021-02-07 16:47:482021-04-26 19:20:09
4th row2021-12-30 23:48:262021-02-19 10:14:53
5th row2021-11-02 11:48:252021-01-14 04:24:54
ValueCountFrequency (%)
2021-01-02 2870
 
0.1%
2021-05-14 2866
 
0.1%
2021-12-25 2860
 
0.1%
2021-01-17 2860
 
0.1%
2021-10-17 2856
 
0.1%
2021-01-26 2856
 
0.1%
2021-08-16 2854
 
0.1%
2021-05-05 2852
 
0.1%
2021-10-16 2850
 
0.1%
2021-09-17 2849
 
0.1%
Other values (86754) 1971427
98.6%
ValueCountFrequency (%)
2021-01-28 110
 
0.2%
2021-04-27 110
 
0.2%
2021-08-04 110
 
0.2%
2021-04-29 107
 
0.2%
2021-09-17 104
 
0.2%
2021-06-30 102
 
0.2%
2021-04-20 102
 
0.2%
2021-02-02 102
 
0.2%
2021-08-17 101
 
0.2%
2021-08-23 101
 
0.2%
Other values (25697) 58951
98.3%
2025-06-06T02:28:51.649462image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 3411647
18.0%
0 3298747
17.4%
1 2941758
15.5%
- 2000000
10.5%
: 2000000
10.5%
1000000
 
5.3%
3 891719
 
4.7%
5 800287
 
4.2%
4 796296
 
4.2%
7 466826
 
2.5%
Other values (3) 1392720
7.3%
ValueCountFrequency (%)
2 102631
18.0%
0 98938
17.4%
1 87968
15.4%
- 60000
10.5%
: 60000
10.5%
30000
 
5.3%
3 26805
 
4.7%
5 24014
 
4.2%
4 23890
 
4.2%
8 14026
 
2.5%
Other values (3) 41728
7.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 19000000
100.0%
ValueCountFrequency (%)
(unknown) 570000
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
2 3411647
18.0%
0 3298747
17.4%
1 2941758
15.5%
- 2000000
10.5%
: 2000000
10.5%
1000000
 
5.3%
3 891719
 
4.7%
5 800287
 
4.2%
4 796296
 
4.2%
7 466826
 
2.5%
Other values (3) 1392720
7.3%
ValueCountFrequency (%)
2 102631
18.0%
0 98938
17.4%
1 87968
15.4%
- 60000
10.5%
: 60000
10.5%
30000
 
5.3%
3 26805
 
4.7%
5 24014
 
4.2%
4 23890
 
4.2%
8 14026
 
2.5%
Other values (3) 41728
7.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 19000000
100.0%
ValueCountFrequency (%)
(unknown) 570000
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
2 3411647
18.0%
0 3298747
17.4%
1 2941758
15.5%
- 2000000
10.5%
: 2000000
10.5%
1000000
 
5.3%
3 891719
 
4.7%
5 800287
 
4.2%
4 796296
 
4.2%
7 466826
 
2.5%
Other values (3) 1392720
7.3%
ValueCountFrequency (%)
2 102631
18.0%
0 98938
17.4%
1 87968
15.4%
- 60000
10.5%
: 60000
10.5%
30000
 
5.3%
3 26805
 
4.7%
5 24014
 
4.2%
4 23890
 
4.2%
8 14026
 
2.5%
Other values (3) 41728
7.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 19000000
100.0%
ValueCountFrequency (%)
(unknown) 570000
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
2 3411647
18.0%
0 3298747
17.4%
1 2941758
15.5%
- 2000000
10.5%
: 2000000
10.5%
1000000
 
5.3%
3 891719
 
4.7%
5 800287
 
4.2%
4 796296
 
4.2%
7 466826
 
2.5%
Other values (3) 1392720
7.3%
ValueCountFrequency (%)
2 102631
18.0%
0 98938
17.4%
1 87968
15.4%
- 60000
10.5%
: 60000
10.5%
30000
 
5.3%
3 26805
 
4.7%
5 24014
 
4.2%
4 23890
 
4.2%
8 14026
 
2.5%
Other values (3) 41728
7.3%

avg_discount_used
Real number (ℝ)

 Full DatasetStratified Sample
Distinct5151
Distinct (%)< 0.1%0.2%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean0.250010090.249945
 Full DatasetStratified Sample
Minimum00
Maximum0.50.5
Zeros10010294
Zeros (%)1.0%1.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:51.820873image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum00
5-th percentile0.030.02
Q10.130.13
median0.250.25
Q30.380.37
95-th percentile0.470.47
Maximum0.50.5
Range0.50.5
Interquartile range (IQR)0.250.24

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation0.14438256280.1436977683
Coefficient of variation (CV)0.57750694310.574917555
Kurtosis-1.19810725-1.189045062
Mean0.250010090.249945
Median Absolute Deviation (MAD)0.120.12
Skewness0.00028185894060.002082182261
Sum250010.097498.35
Variance0.020846324440.02064904861
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:52.016780image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.39 20194
 
2.0%
0.15 20188
 
2.0%
0.08 20140
 
2.0%
0.21 20138
 
2.0%
0.34 20131
 
2.0%
0.47 20125
 
2.0%
0.05 20124
 
2.0%
0.16 20123
 
2.0%
0.46 20109
 
2.0%
0.32 20093
 
2.0%
Other values (41) 798635
79.9%
ValueCountFrequency (%)
0.11 683
 
2.3%
0.21 668
 
2.2%
0.17 652
 
2.2%
0.34 644
 
2.1%
0.33 643
 
2.1%
0.31 638
 
2.1%
0.28 623
 
2.1%
0.41 623
 
2.1%
0.4 622
 
2.1%
0.43 619
 
2.1%
Other values (41) 23585
78.6%
ValueCountFrequency (%)
0 10010
1.0%
0.01 19893
2.0%
0.02 19951
2.0%
0.03 19949
2.0%
0.04 20004
2.0%
ValueCountFrequency (%)
0 294
1.0%
0.01 603
2.0%
0.02 607
2.0%
0.03 564
1.9%
0.04 570
1.9%
ValueCountFrequency (%)
0 294
< 0.1%
0.01 603
0.1%
0.02 607
0.1%
0.03 564
0.1%
0.04 570
0.1%
ValueCountFrequency (%)
0 10010
33.4%
0.01 19893
66.3%
0.02 19951
66.5%
0.03 19949
66.5%
0.04 20004
66.7%

preferred_store
['Text', 'Text']

 Full DatasetStratified Sample
Distinct44
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:52.259340image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length1010
Median length1010
Mean length1010
Min length1010

Characters and Unicode

 Full DatasetStratified Sample
Total characters10000000300000
Distinct characters1212
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowLocation ALocation A
2nd rowLocation CLocation B
3rd rowLocation BLocation A
4th rowLocation BLocation B
5th rowLocation BLocation C
ValueCountFrequency (%)
location 1000000
50.0%
b 250262
 
12.5%
d 250007
 
12.5%
a 249949
 
12.5%
c 249782
 
12.5%
ValueCountFrequency (%)
location 30000
50.0%
a 7555
 
12.6%
b 7550
 
12.6%
d 7484
 
12.5%
c 7411
 
12.4%
2025-06-06T02:28:52.554165image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 2000000
20.0%
L 1000000
10.0%
c 1000000
10.0%
a 1000000
10.0%
t 1000000
10.0%
i 1000000
10.0%
n 1000000
10.0%
1000000
10.0%
B 250262
 
2.5%
D 250007
 
2.5%
Other values (2) 499731
 
5.0%
ValueCountFrequency (%)
o 60000
20.0%
L 30000
10.0%
c 30000
10.0%
a 30000
10.0%
t 30000
10.0%
i 30000
10.0%
n 30000
10.0%
30000
10.0%
A 7555
 
2.5%
B 7550
 
2.5%
Other values (2) 14895
 
5.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 10000000
100.0%
ValueCountFrequency (%)
(unknown) 300000
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o 2000000
20.0%
L 1000000
10.0%
c 1000000
10.0%
a 1000000
10.0%
t 1000000
10.0%
i 1000000
10.0%
n 1000000
10.0%
1000000
10.0%
B 250262
 
2.5%
D 250007
 
2.5%
Other values (2) 499731
 
5.0%
ValueCountFrequency (%)
o 60000
20.0%
L 30000
10.0%
c 30000
10.0%
a 30000
10.0%
t 30000
10.0%
i 30000
10.0%
n 30000
10.0%
30000
10.0%
A 7555
 
2.5%
B 7550
 
2.5%
Other values (2) 14895
 
5.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 10000000
100.0%
ValueCountFrequency (%)
(unknown) 300000
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o 2000000
20.0%
L 1000000
10.0%
c 1000000
10.0%
a 1000000
10.0%
t 1000000
10.0%
i 1000000
10.0%
n 1000000
10.0%
1000000
10.0%
B 250262
 
2.5%
D 250007
 
2.5%
Other values (2) 499731
 
5.0%
ValueCountFrequency (%)
o 60000
20.0%
L 30000
10.0%
c 30000
10.0%
a 30000
10.0%
t 30000
10.0%
i 30000
10.0%
n 30000
10.0%
30000
10.0%
A 7555
 
2.5%
B 7550
 
2.5%
Other values (2) 14895
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 10000000
100.0%
ValueCountFrequency (%)
(unknown) 300000
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o 2000000
20.0%
L 1000000
10.0%
c 1000000
10.0%
a 1000000
10.0%
t 1000000
10.0%
i 1000000
10.0%
n 1000000
10.0%
1000000
10.0%
B 250262
 
2.5%
D 250007
 
2.5%
Other values (2) 499731
 
5.0%
ValueCountFrequency (%)
o 60000
20.0%
L 30000
10.0%
c 30000
10.0%
a 30000
10.0%
t 30000
10.0%
i 30000
10.0%
n 30000
10.0%
30000
10.0%
A 7555
 
2.5%
B 7550
 
2.5%
Other values (2) 14895
 
5.0%

online_purchases
Real number (ℝ)

 Full DatasetStratified Sample
Distinct100100
Distinct (%)< 0.1%0.3%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean49.44601849.47733333
 Full DatasetStratified Sample
Minimum00
Maximum9999
Zeros9997299
Zeros (%)1.0%1.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:52.718979image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum00
5-th percentile44
Q12425
median4949
Q37474
95-th percentile9494
Maximum9999
Range9999
Interquartile range (IQR)5049

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation28.8614391328.79320732
Coefficient of variation (CV)0.58369592340.581947437
Kurtosis-1.200754846-1.193949208
Mean49.44601849.47733333
Median Absolute Deviation (MAD)2525
Skewness0.00143421854-0.00151205085
Sum494460181484320
Variance832.9826689829.0487878
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:53.346927image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4 10324
 
1.0%
28 10269
 
1.0%
40 10198
 
1.0%
67 10151
 
1.0%
76 10150
 
1.0%
61 10150
 
1.0%
52 10140
 
1.0%
88 10134
 
1.0%
43 10133
 
1.0%
45 10132
 
1.0%
Other values (90) 898219
89.8%
ValueCountFrequency (%)
33 337
 
1.1%
49 333
 
1.1%
51 331
 
1.1%
60 329
 
1.1%
6 329
 
1.1%
42 326
 
1.1%
28 326
 
1.1%
96 324
 
1.1%
38 324
 
1.1%
76 324
 
1.1%
Other values (90) 26717
89.1%
ValueCountFrequency (%)
0 9997
1.0%
1 10023
1.0%
2 9792
1.0%
3 10091
1.0%
4 10324
1.0%
ValueCountFrequency (%)
0 299
1.0%
1 301
1.0%
2 293
1.0%
3 322
1.1%
4 313
1.0%
ValueCountFrequency (%)
0 299
< 0.1%
1 301
< 0.1%
2 293
< 0.1%
3 322
< 0.1%
4 313
< 0.1%
ValueCountFrequency (%)
0 9997
33.3%
1 10023
33.4%
2 9792
32.6%
3 10091
33.6%
4 10324
34.4%

in_store_purchases
Real number (ℝ)

 Full DatasetStratified Sample
Distinct100100
Distinct (%)< 0.1%0.3%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean49.48448649.26486667
 Full DatasetStratified Sample
Minimum00
Maximum9999
Zeros10016321
Zeros (%)1.0%1.1%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:53.573253image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum00
5-th percentile55
Q12424
median4949
Q37574
95-th percentile9594
Maximum9999
Range9999
Interquartile range (IQR)5150

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation28.8827117428.80988452
Coefficient of variation (CV)0.58367205720.5847957473
Kurtosis-1.20140369-1.198586796
Mean49.48448649.26486667
Median Absolute Deviation (MAD)2525
Skewness0.001590435670.01270653627
Sum494844861477946
Variance834.2110375830.009446
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:53.782730image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
38 10264
 
1.0%
30 10186
 
1.0%
86 10183
 
1.0%
10 10180
 
1.0%
14 10171
 
1.0%
7 10166
 
1.0%
13 10164
 
1.0%
50 10151
 
1.0%
67 10141
 
1.0%
91 10131
 
1.0%
Other values (90) 898263
89.8%
ValueCountFrequency (%)
71 336
 
1.1%
26 334
 
1.1%
31 334
 
1.1%
63 332
 
1.1%
32 329
 
1.1%
89 327
 
1.1%
67 325
 
1.1%
16 323
 
1.1%
79 323
 
1.1%
28 322
 
1.1%
Other values (90) 26715
89.0%
ValueCountFrequency (%)
0 10016
1.0%
1 9978
1.0%
2 9953
1.0%
3 9965
1.0%
4 9926
1.0%
ValueCountFrequency (%)
0 321
1.1%
1 284
0.9%
2 283
0.9%
3 316
1.1%
4 287
1.0%
ValueCountFrequency (%)
0 321
< 0.1%
1 284
< 0.1%
2 283
< 0.1%
3 316
< 0.1%
4 287
< 0.1%
ValueCountFrequency (%)
0 10016
33.4%
1 9978
33.3%
2 9953
33.2%
3 9965
33.2%
4 9926
33.1%

avg_items_per_transaction
Real number (ℝ)

 Full DatasetStratified Sample
Distinct901901
Distinct (%)0.1%3.0%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean5.503121875.484473333
 Full DatasetStratified Sample
Minimum11
Maximum1010
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:53.983278image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum11
5-th percentile1.451.43
Q13.263.23
median5.55.49
Q37.757.73
95-th percentile9.559.56
Maximum1010
Range99
Interquartile range (IQR)4.494.5

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation2.5976612752.606511798
Coefficient of variation (CV)0.47203411730.4752528893
Kurtosis-1.199082145-1.198518161
Mean5.503121875.484473333
Median Absolute Deviation (MAD)2.252.25
Skewness-4.461054903 × 10-50.003201275435
Sum5503121.87164534.2
Variance6.7478440976.793903753
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:54.179899image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3.49 1205
 
0.1%
5 1198
 
0.1%
3.94 1197
 
0.1%
6.41 1196
 
0.1%
2.82 1193
 
0.1%
8.41 1192
 
0.1%
9.69 1192
 
0.1%
4.29 1190
 
0.1%
4.35 1188
 
0.1%
6.14 1188
 
0.1%
Other values (891) 988061
98.8%
ValueCountFrequency (%)
5.89 59
 
0.2%
5.11 50
 
0.2%
9.59 48
 
0.2%
6.8 48
 
0.2%
2.22 48
 
0.2%
5.34 47
 
0.2%
1.49 46
 
0.2%
5.21 46
 
0.2%
8.13 46
 
0.2%
5.58 46
 
0.2%
Other values (891) 29516
98.4%
ValueCountFrequency (%)
1 514
0.1%
1.01 1135
0.1%
1.02 1105
0.1%
1.03 1122
0.1%
1.04 1067
0.1%
ValueCountFrequency (%)
1 17
0.1%
1.01 26
0.1%
1.02 42
0.1%
1.03 36
0.1%
1.04 30
0.1%
ValueCountFrequency (%)
1 17
< 0.1%
1.01 26
< 0.1%
1.02 42
< 0.1%
1.03 36
< 0.1%
1.04 30
< 0.1%
ValueCountFrequency (%)
1 514
1.7%
1.01 1135
3.8%
1.02 1105
3.7%
1.03 1122
3.7%
1.04 1067
3.6%

avg_transaction_value
Real number (ℝ)

 Full DatasetStratified Sample
Distinct4900122408
Distinct (%)4.9%74.7%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean255.1157678254.3950917
 Full DatasetStratified Sample
Minimum1010
Maximum500499.99
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:54.416082image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum1010
5-th percentile34.5234.1295
Q1132.51132.5075
median255.23254.55
Q3377.67375.81
95-th percentile475.36475.502
Maximum500499.99
Range490489.99
Interquartile range (IQR)245.16243.3025

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation141.4300141141.2426008
Coefficient of variation (CV)0.55437582430.5552096146
Kurtosis-1.200885422-1.191863439
Mean255.1157678254.3950917
Median Absolute Deviation (MAD)122.58121.64
Skewness-0.0011481632220.004421617675
Sum255115767.87631852.75
Variance20002.4488819949.47228
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:54.635129image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
362.11 43
 
< 0.1%
157.68 43
 
< 0.1%
86.72 42
 
< 0.1%
303.99 41
 
< 0.1%
193.66 41
 
< 0.1%
112.64 40
 
< 0.1%
342.26 40
 
< 0.1%
454.72 39
 
< 0.1%
64.18 39
 
< 0.1%
280.55 39
 
< 0.1%
Other values (48991) 999593
> 99.9%
ValueCountFrequency (%)
262.56 6
 
< 0.1%
275.93 5
 
< 0.1%
359.56 5
 
< 0.1%
149.46 5
 
< 0.1%
153.96 5
 
< 0.1%
245.13 5
 
< 0.1%
194.86 5
 
< 0.1%
416.22 5
 
< 0.1%
338.17 5
 
< 0.1%
413.57 5
 
< 0.1%
Other values (22398) 29949
99.8%
ValueCountFrequency (%)
10 8
 
< 0.1%
10.01 18
< 0.1%
10.02 17
< 0.1%
10.03 28
< 0.1%
10.04 24
< 0.1%
ValueCountFrequency (%)
10 1
< 0.1%
10.03 1
< 0.1%
10.04 1
< 0.1%
10.05 1
< 0.1%
10.06 2
< 0.1%
ValueCountFrequency (%)
10 1
< 0.1%
10.03 1
< 0.1%
10.04 1
< 0.1%
10.05 1
< 0.1%
10.06 2
< 0.1%
ValueCountFrequency (%)
10 8
 
< 0.1%
10.01 18
0.1%
10.02 17
0.1%
10.03 28
0.1%
10.04 24
0.1%

total_returned_items
Real number (ℝ)

 Full DatasetStratified Sample
Distinct1010
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean4.4981424.474033333
 Full DatasetStratified Sample
Minimum00
Maximum99
Zeros1000603026
Zeros (%)10.0%10.1%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:54.769682image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum00
5-th percentile00
Q122
median44
Q377
95-th percentile99
Maximum99
Range99
Interquartile range (IQR)55

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation2.8728050412.859463519
Coefficient of variation (CV)0.63866481770.6391243216
Kurtosis-1.225109848-1.207499289
Mean4.4981424.474033333
Median Absolute Deviation (MAD)32
Skewness0.00076922547280.01226172092
Sum4498142134221
Variance8.2530088018.176531617
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:54.870993image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1 100298
10.0%
7 100190
10.0%
3 100119
10.0%
0 100060
10.0%
6 100004
10.0%
2 99991
10.0%
9 99942
10.0%
8 99838
10.0%
4 99821
10.0%
5 99737
10.0%
ValueCountFrequency (%)
3 3097
10.3%
5 3073
10.2%
4 3035
10.1%
0 3026
10.1%
6 3024
10.1%
2 2989
10.0%
1 2967
9.9%
9 2948
9.8%
7 2924
9.7%
8 2917
9.7%
ValueCountFrequency (%)
0 100060
10.0%
1 100298
10.0%
2 99991
10.0%
3 100119
10.0%
4 99821
10.0%
ValueCountFrequency (%)
0 3026
10.1%
1 2967
9.9%
2 2989
10.0%
3 3097
10.3%
4 3035
10.1%
ValueCountFrequency (%)
0 3026
0.3%
1 2967
0.3%
2 2989
0.3%
3 3097
0.3%
4 3035
0.3%
ValueCountFrequency (%)
0 100060
333.5%
1 100298
334.3%
2 99991
333.3%
3 100119
333.7%
4 99821
332.7%

total_returned_value
Real number (ℝ)

 Full DatasetStratified Sample
Distinct9999925888
Distinct (%)10.0%86.3%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean500.3878374501.0130153
 Full DatasetStratified Sample
Minimum00.03
Maximum10001000
Zeros40
Zeros (%)< 0.1%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:55.180418image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum00.03
5-th percentile5050.778
Q1250.63253.4025
median500.4498.185
Q3750.39751.7325
95-th percentile950.22949.7815
Maximum10001000
Range1000999.97
Interquartile range (IQR)499.76498.33

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation288.7174763288.2634295
Coefficient of variation (CV)0.57698739810.5753611597
Kurtosis-1.199754459-1.200630145
Mean500.3878374501.0130153
Median Absolute Deviation (MAD)249.89249.22
Skewness-0.0012648288210.003471319382
Sum500387837.415030390.46
Variance83357.7811583095.8048
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:55.421259image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
160.66 28
 
< 0.1%
467.66 26
 
< 0.1%
188.3 26
 
< 0.1%
488.88 25
 
< 0.1%
544.94 25
 
< 0.1%
651.87 25
 
< 0.1%
981.42 25
 
< 0.1%
330.91 25
 
< 0.1%
676.05 25
 
< 0.1%
227.59 25
 
< 0.1%
Other values (99989) 999745
> 99.9%
ValueCountFrequency (%)
260.36 6
 
< 0.1%
15.38 5
 
< 0.1%
108.29 5
 
< 0.1%
392.6 4
 
< 0.1%
282.57 4
 
< 0.1%
851.39 4
 
< 0.1%
815.1 4
 
< 0.1%
122.83 4
 
< 0.1%
251.53 4
 
< 0.1%
730.69 4
 
< 0.1%
Other values (25878) 29956
99.9%
ValueCountFrequency (%)
0 4
 
< 0.1%
0.01 13
< 0.1%
0.02 12
< 0.1%
0.03 11
< 0.1%
0.04 7
< 0.1%
ValueCountFrequency (%)
0.03 1
< 0.1%
0.05 1
< 0.1%
0.06 1
< 0.1%
0.11 1
< 0.1%
0.17 1
< 0.1%
ValueCountFrequency (%)
0.03 1
< 0.1%
0.05 1
< 0.1%
0.06 1
< 0.1%
0.11 1
< 0.1%
0.17 1
< 0.1%
ValueCountFrequency (%)
0 4
 
< 0.1%
0.01 13
< 0.1%
0.02 12
< 0.1%
0.03 11
< 0.1%
0.04 7
< 0.1%

total_sales
Real number (ℝ)

 Full DatasetStratified Sample
Distinct62925429561
Distinct (%)62.9%98.5%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean5056.0597655039.871729
 Full DatasetStratified Sample
Minimum100.01100.06
Maximum9999.989999.78
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:55.648496image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum100.01100.06
5-th percentile595.7095585.243
Q12577.86752544.635
median5059.6955029.3
Q37534.80257536.7425
95-th percentile9507.969496.1245
Maximum9999.989999.78
Range9899.979899.72
Interquartile range (IQR)4956.9354992.1075

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation2859.1000582862.248413
Coefficient of variation (CV)0.56547987770.5679208851
Kurtosis-1.201132214-1.207616505
Mean5056.0597655039.871729
Median Absolute Deviation (MAD)2478.3652495.845
Skewness-0.0027923553470.001697316071
Sum5056059765151196151.9
Variance8174453.148192465.978
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:55.851431image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9263.29 8
 
< 0.1%
1070.51 8
 
< 0.1%
8973.11 8
 
< 0.1%
7882.97 8
 
< 0.1%
630.03 8
 
< 0.1%
8669.59 8
 
< 0.1%
8191.02 8
 
< 0.1%
2558.91 8
 
< 0.1%
5572.95 8
 
< 0.1%
8266.95 8
 
< 0.1%
Other values (629244) 999920
> 99.9%
ValueCountFrequency (%)
9287.32 3
 
< 0.1%
5343.21 3
 
< 0.1%
8517.35 2
 
< 0.1%
6020.86 2
 
< 0.1%
5550.99 2
 
< 0.1%
3278.77 2
 
< 0.1%
3372.59 2
 
< 0.1%
7783.93 2
 
< 0.1%
569.43 2
 
< 0.1%
7306.93 2
 
< 0.1%
Other values (29551) 29978
99.9%
ValueCountFrequency (%)
100.01 2
< 0.1%
100.02 2
< 0.1%
100.04 2
< 0.1%
100.05 2
< 0.1%
100.06 3
< 0.1%
ValueCountFrequency (%)
100.06 1
< 0.1%
100.14 1
< 0.1%
100.38 1
< 0.1%
100.48 1
< 0.1%
101.81 1
< 0.1%
ValueCountFrequency (%)
100.06 1
< 0.1%
100.14 1
< 0.1%
100.38 1
< 0.1%
100.48 1
< 0.1%
101.81 1
< 0.1%
ValueCountFrequency (%)
100.01 2
< 0.1%
100.02 2
< 0.1%
100.04 2
< 0.1%
100.05 2
< 0.1%
100.06 3
< 0.1%

total_transactions
Real number (ℝ)

 Full DatasetStratified Sample
Distinct9999
Distinct (%)< 0.1%0.3%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean49.98738649.79873333
 Full DatasetStratified Sample
Minimum11
Maximum9999
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:56.081733image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum11
5-th percentile55
Q12525
median5049
Q37575
95-th percentile9595
Maximum9999
Range9898
Interquartile range (IQR)5050

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation28.5716889528.65419796
Coefficient of variation (CV)0.57157797660.5754001365
Kurtosis-1.200697232-1.206897219
Mean49.98738649.79873333
Median Absolute Deviation (MAD)2525
Skewness6.496950968 × 10-50.01136407827
Sum499873861493962
Variance816.3414092821.0630605
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:56.428172image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
93 10385
 
1.0%
24 10328
 
1.0%
61 10316
 
1.0%
70 10306
 
1.0%
49 10290
 
1.0%
14 10280
 
1.0%
83 10278
 
1.0%
27 10251
 
1.0%
16 10247
 
1.0%
75 10245
 
1.0%
Other values (89) 897074
89.7%
ValueCountFrequency (%)
34 342
 
1.1%
41 338
 
1.1%
62 329
 
1.1%
16 329
 
1.1%
36 329
 
1.1%
13 328
 
1.1%
11 327
 
1.1%
70 327
 
1.1%
83 325
 
1.1%
47 323
 
1.1%
Other values (89) 26703
89.0%
ValueCountFrequency (%)
1 10053
1.0%
2 10174
1.0%
3 10113
1.0%
4 10133
1.0%
5 10140
1.0%
ValueCountFrequency (%)
1 311
1.0%
2 302
1.0%
3 310
1.0%
4 289
1.0%
5 306
1.0%
ValueCountFrequency (%)
1 311
< 0.1%
2 302
< 0.1%
3 310
< 0.1%
4 289
< 0.1%
5 306
< 0.1%
ValueCountFrequency (%)
1 10053
33.5%
2 10174
33.9%
3 10113
33.7%
4 10133
33.8%
5 10140
33.8%

total_items_purchased
Real number (ℝ)

 Full DatasetStratified Sample
Distinct499499
Distinct (%)< 0.1%1.7%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean250.042763251.2563333
 Full DatasetStratified Sample
Minimum11
Maximum499499
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:56.735046image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum11
5-th percentile2626
Q1125126
median250252
Q3375377
95-th percentile475475
Maximum499499
Range498498
Interquartile range (IQR)250251

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation143.9845462144.3084035
Coefficient of variation (CV)0.57583968620.5743473273
Kurtosis-1.199364571-1.201958267
Mean250.042763251.2563333
Median Absolute Deviation (MAD)125125
Skewness-0.0005289537985-0.008483461714
Sum2500427637537690
Variance20731.5495420824.91532
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:57.017962image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
282 2156
 
0.2%
285 2146
 
0.2%
355 2132
 
0.2%
459 2099
 
0.2%
296 2098
 
0.2%
241 2096
 
0.2%
413 2090
 
0.2%
331 2088
 
0.2%
425 2087
 
0.2%
260 2086
 
0.2%
Other values (489) 978922
97.9%
ValueCountFrequency (%)
199 84
 
0.3%
340 82
 
0.3%
263 80
 
0.3%
352 77
 
0.3%
457 76
 
0.3%
471 76
 
0.3%
225 76
 
0.3%
121 76
 
0.3%
29 76
 
0.3%
283 75
 
0.2%
Other values (489) 29222
97.4%
ValueCountFrequency (%)
1 2005
0.2%
2 2077
0.2%
3 1999
0.2%
4 2019
0.2%
5 1988
0.2%
ValueCountFrequency (%)
1 63
0.2%
2 69
0.2%
3 51
0.2%
4 65
0.2%
5 46
0.2%
ValueCountFrequency (%)
1 63
< 0.1%
2 69
< 0.1%
3 51
< 0.1%
4 65
< 0.1%
5 46
< 0.1%
ValueCountFrequency (%)
1 2005
6.7%
2 2077
6.9%
3 1999
6.7%
4 2019
6.7%
5 1988
6.6%

total_discounts_received
Real number (ℝ)

 Full DatasetStratified Sample
Distinct9999525951
Distinct (%)10.0%86.5%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean499.6743882499.1838027
 Full DatasetStratified Sample
Minimum00.02
Maximum1000999.88
Zeros60
Zeros (%)< 0.1%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:57.316905image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum00.02
5-th percentile50.1649.6795
Q1249.76248.7875
median499.51498.065
Q3749.54751.8575
95-th percentile949.66951.02
Maximum1000999.88
Range1000999.86
Interquartile range (IQR)499.78503.07

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation288.5791016289.4257723
Coefficient of variation (CV)0.57753430710.5797980039
Kurtosis-1.200167414-1.207019765
Mean499.6743882499.1838027
Median Absolute Deviation (MAD)249.9251.37
Skewness0.00097450105350.001267690242
Sum499674388.214975514.08
Variance83277.8978683767.2777
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:57.615741image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
52.87 26
 
< 0.1%
811.21 26
 
< 0.1%
721.58 26
 
< 0.1%
418.88 25
 
< 0.1%
760.97 25
 
< 0.1%
406.5 24
 
< 0.1%
784.58 24
 
< 0.1%
595.87 24
 
< 0.1%
918 24
 
< 0.1%
34.86 24
 
< 0.1%
Other values (99985) 999752
> 99.9%
ValueCountFrequency (%)
236.75 5
 
< 0.1%
236.37 5
 
< 0.1%
157.68 4
 
< 0.1%
240.47 4
 
< 0.1%
643.15 4
 
< 0.1%
28.71 4
 
< 0.1%
557.58 4
 
< 0.1%
44.76 4
 
< 0.1%
788.39 4
 
< 0.1%
234.73 4
 
< 0.1%
Other values (25941) 29958
99.9%
ValueCountFrequency (%)
0 6
< 0.1%
0.01 13
< 0.1%
0.02 8
< 0.1%
0.03 8
< 0.1%
0.04 6
< 0.1%
ValueCountFrequency (%)
0.02 1
< 0.1%
0.03 1
< 0.1%
0.05 1
< 0.1%
0.06 1
< 0.1%
0.13 1
< 0.1%
ValueCountFrequency (%)
0.02 1
< 0.1%
0.03 1
< 0.1%
0.05 1
< 0.1%
0.06 1
< 0.1%
0.13 1
< 0.1%
ValueCountFrequency (%)
0 6
< 0.1%
0.01 13
< 0.1%
0.02 8
< 0.1%
0.03 8
< 0.1%
0.04 6
< 0.1%

avg_spent_per_category
Real number (ℝ)

 Full DatasetStratified Sample
Distinct9899925850
Distinct (%)9.9%86.2%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean505.1754779504.7207147
 Full DatasetStratified Sample
Minimum1010.02
Maximum10001000
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:57.923560image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum1010.02
5-th percentile59.4959.958
Q1257.24257.0825
median505.14501.885
Q3753.06753.1025
95-th percentile950.7405951.441
Maximum10001000
Range990989.98
Interquartile range (IQR)495.82496.02

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation286.0591784286.2765849
Coefficient of variation (CV)0.5662570550.5671980099
Kurtosis-1.201963641-1.201876857
Mean505.1754779504.7207147
Median Absolute Deviation (MAD)247.91247.985
Skewness-0.00024549591330.008955884632
Sum505175477.915141621.44
Variance81829.8535581954.28307
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:58.240519image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
202.69 27
 
< 0.1%
969.16 26
 
< 0.1%
582.24 26
 
< 0.1%
806.1 25
 
< 0.1%
330.74 25
 
< 0.1%
798.54 25
 
< 0.1%
299.29 25
 
< 0.1%
312.38 25
 
< 0.1%
825.53 24
 
< 0.1%
525.28 24
 
< 0.1%
Other values (98989) 999748
> 99.9%
ValueCountFrequency (%)
174.08 5
 
< 0.1%
112.13 5
 
< 0.1%
966.69 4
 
< 0.1%
866.33 4
 
< 0.1%
65.43 4
 
< 0.1%
628.89 4
 
< 0.1%
35.16 4
 
< 0.1%
803.28 4
 
< 0.1%
99.97 4
 
< 0.1%
344.54 4
 
< 0.1%
Other values (25840) 29958
99.9%
ValueCountFrequency (%)
10 4
 
< 0.1%
10.01 8
< 0.1%
10.02 13
< 0.1%
10.03 10
< 0.1%
10.04 13
< 0.1%
ValueCountFrequency (%)
10.02 2
< 0.1%
10.04 1
< 0.1%
10.05 1
< 0.1%
10.15 1
< 0.1%
10.32 1
< 0.1%
ValueCountFrequency (%)
10.02 2
< 0.1%
10.04 1
< 0.1%
10.05 1
< 0.1%
10.15 1
< 0.1%
10.32 1
< 0.1%
ValueCountFrequency (%)
10 4
 
< 0.1%
10.01 8
< 0.1%
10.02 13
< 0.1%
10.03 10
< 0.1%
10.04 13
< 0.1%

max_single_purchase_value
Real number (ℝ)

 Full DatasetStratified Sample
Distinct9900125798
Distinct (%)9.9%86.0%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean505.0014045501.5595277
 Full DatasetStratified Sample
Minimum1010.04
Maximum1000999.99
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:58.591574image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum1010.04
5-th percentile59.358.76
Q1256.84251.2725
median505.22499.845
Q3753.21750.16
95-th percentile950.55951.283
Maximum1000999.99
Range990989.95
Interquartile range (IQR)496.37498.8875

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation286.0733241286.5305105
Coefficient of variation (CV)0.56648025450.5712791698
Kurtosis-1.202495075-1.210325642
Mean505.0014045501.5595277
Median Absolute Deviation (MAD)248.18249.455
Skewness-0.00084668908070.01889523734
Sum505001404.515046785.83
Variance81837.9467782099.73347
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:58.955258image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
575.57 28
 
< 0.1%
461.6 26
 
< 0.1%
874.29 25
 
< 0.1%
105.78 25
 
< 0.1%
354.85 25
 
< 0.1%
736.87 25
 
< 0.1%
439.72 25
 
< 0.1%
893.75 24
 
< 0.1%
179.32 24
 
< 0.1%
330.94 24
 
< 0.1%
Other values (98991) 999749
> 99.9%
ValueCountFrequency (%)
261.06 5
 
< 0.1%
39.12 5
 
< 0.1%
769.44 4
 
< 0.1%
99.25 4
 
< 0.1%
879.12 4
 
< 0.1%
883.42 4
 
< 0.1%
507.84 4
 
< 0.1%
548.53 4
 
< 0.1%
570.99 4
 
< 0.1%
668 4
 
< 0.1%
Other values (25788) 29958
99.9%
ValueCountFrequency (%)
10 6
 
< 0.1%
10.01 8
< 0.1%
10.02 15
< 0.1%
10.03 5
 
< 0.1%
10.04 15
< 0.1%
ValueCountFrequency (%)
10.04 1
< 0.1%
10.08 1
< 0.1%
10.09 2
< 0.1%
10.1 1
< 0.1%
10.11 1
< 0.1%
ValueCountFrequency (%)
10.04 1
< 0.1%
10.08 1
< 0.1%
10.09 2
< 0.1%
10.1 1
< 0.1%
10.11 1
< 0.1%
ValueCountFrequency (%)
10 6
 
< 0.1%
10.01 8
< 0.1%
10.02 15
0.1%
10.03 5
 
< 0.1%
10.04 15
0.1%

min_single_purchase_value
Real number (ℝ)

 Full DatasetStratified Sample
Distinct991991
Distinct (%)0.1%3.3%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean5.043848965.048166667
 Full DatasetStratified Sample
Minimum0.10.1
Maximum1010
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:28:59.779674image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum0.10.1
5-th percentile0.590.61
Q12.572.57
median5.045.06
Q37.517.51
95-th percentile9.59.47
Maximum1010
Range9.99.9
Interquartile range (IQR)4.944.94

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation2.8559046442.845148112
Coefficient of variation (CV)0.5662153380.5636002732
Kurtosis-1.198193882-1.201234009
Mean5.043848965.048166667
Median Absolute Deviation (MAD)2.472.47
Skewness0.002415403507-0.00852735842
Sum5043848.96151445
Variance8.1561913358.094867781
MonotonicityNot monotonicNot monotonic
2025-06-06T02:28:59.981767image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4.66 1123
 
0.1%
0.29 1112
 
0.1%
3.05 1110
 
0.1%
1.67 1101
 
0.1%
4.67 1092
 
0.1%
6.93 1092
 
0.1%
6.14 1091
 
0.1%
5.19 1091
 
0.1%
5.31 1086
 
0.1%
5.02 1086
 
0.1%
Other values (981) 989016
98.9%
ValueCountFrequency (%)
0.19 48
 
0.2%
1.31 44
 
0.1%
2.78 44
 
0.1%
8.09 44
 
0.1%
5.02 43
 
0.1%
0.97 43
 
0.1%
2.2 43
 
0.1%
1.06 43
 
0.1%
6.16 43
 
0.1%
5.81 43
 
0.1%
Other values (981) 29562
98.5%
ValueCountFrequency (%)
0.1 491
< 0.1%
0.11 1041
0.1%
0.12 1011
0.1%
0.13 1044
0.1%
0.14 1013
0.1%
ValueCountFrequency (%)
0.1 15
 
0.1%
0.11 31
0.1%
0.12 25
0.1%
0.13 38
0.1%
0.14 18
0.1%
ValueCountFrequency (%)
0.1 15
 
< 0.1%
0.11 31
< 0.1%
0.12 25
< 0.1%
0.13 38
< 0.1%
0.14 18
< 0.1%
ValueCountFrequency (%)
0.1 491
1.6%
0.11 1041
3.5%
0.12 1011
3.4%
0.13 1044
3.5%
0.14 1013
3.4%

product_name
['Text', 'Text']

 Full DatasetStratified Sample
Distinct44
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:00.206441image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length99
Median length99
Mean length99
Min length99

Characters and Unicode

 Full DatasetStratified Sample
Total characters9000000270000
Distinct characters1212
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowProduct DProduct D
2nd rowProduct CProduct B
3rd rowProduct BProduct A
4th rowProduct AProduct B
5th rowProduct CProduct C
ValueCountFrequency (%)
product 1000000
50.0%
b 250375
 
12.5%
c 249957
 
12.5%
a 249928
 
12.5%
d 249740
 
12.5%
ValueCountFrequency (%)
product 30000
50.0%
b 7575
 
12.6%
d 7556
 
12.6%
c 7441
 
12.4%
a 7428
 
12.4%
2025-06-06T02:29:00.513712image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
P 1000000
11.1%
r 1000000
11.1%
o 1000000
11.1%
d 1000000
11.1%
u 1000000
11.1%
c 1000000
11.1%
t 1000000
11.1%
1000000
11.1%
B 250375
 
2.8%
C 249957
 
2.8%
Other values (2) 499668
5.6%
ValueCountFrequency (%)
P 30000
11.1%
r 30000
11.1%
o 30000
11.1%
d 30000
11.1%
u 30000
11.1%
c 30000
11.1%
t 30000
11.1%
30000
11.1%
B 7575
 
2.8%
D 7556
 
2.8%
Other values (2) 14869
5.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 9000000
100.0%
ValueCountFrequency (%)
(unknown) 270000
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
P 1000000
11.1%
r 1000000
11.1%
o 1000000
11.1%
d 1000000
11.1%
u 1000000
11.1%
c 1000000
11.1%
t 1000000
11.1%
1000000
11.1%
B 250375
 
2.8%
C 249957
 
2.8%
Other values (2) 499668
5.6%
ValueCountFrequency (%)
P 30000
11.1%
r 30000
11.1%
o 30000
11.1%
d 30000
11.1%
u 30000
11.1%
c 30000
11.1%
t 30000
11.1%
30000
11.1%
B 7575
 
2.8%
D 7556
 
2.8%
Other values (2) 14869
5.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 9000000
100.0%
ValueCountFrequency (%)
(unknown) 270000
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
P 1000000
11.1%
r 1000000
11.1%
o 1000000
11.1%
d 1000000
11.1%
u 1000000
11.1%
c 1000000
11.1%
t 1000000
11.1%
1000000
11.1%
B 250375
 
2.8%
C 249957
 
2.8%
Other values (2) 499668
5.6%
ValueCountFrequency (%)
P 30000
11.1%
r 30000
11.1%
o 30000
11.1%
d 30000
11.1%
u 30000
11.1%
c 30000
11.1%
t 30000
11.1%
30000
11.1%
B 7575
 
2.8%
D 7556
 
2.8%
Other values (2) 14869
5.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 9000000
100.0%
ValueCountFrequency (%)
(unknown) 270000
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
P 1000000
11.1%
r 1000000
11.1%
o 1000000
11.1%
d 1000000
11.1%
u 1000000
11.1%
c 1000000
11.1%
t 1000000
11.1%
1000000
11.1%
B 250375
 
2.8%
C 249957
 
2.8%
Other values (2) 499668
5.6%
ValueCountFrequency (%)
P 30000
11.1%
r 30000
11.1%
o 30000
11.1%
d 30000
11.1%
u 30000
11.1%
c 30000
11.1%
t 30000
11.1%
30000
11.1%
B 7575
 
2.8%
D 7556
 
2.8%
Other values (2) 14869
5.5%

product_brand
['Text', 'Text']

 Full DatasetStratified Sample
Distinct33
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:00.663829image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length77
Median length77
Mean length77
Min length77

Characters and Unicode

 Full DatasetStratified Sample
Total characters7000000210000
Distinct characters99
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowBrand YBrand Z
2nd rowBrand XBrand Z
3rd rowBrand XBrand X
4th rowBrand ZBrand Y
5th rowBrand XBrand X
ValueCountFrequency (%)
brand 1000000
50.0%
y 333775
 
16.7%
z 333608
 
16.7%
x 332617
 
16.6%
ValueCountFrequency (%)
brand 30000
50.0%
z 10045
 
16.7%
y 10035
 
16.7%
x 9920
 
16.5%
2025-06-06T02:29:00.926397image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
B 1000000
14.3%
r 1000000
14.3%
a 1000000
14.3%
n 1000000
14.3%
d 1000000
14.3%
1000000
14.3%
Y 333775
 
4.8%
Z 333608
 
4.8%
X 332617
 
4.8%
ValueCountFrequency (%)
B 30000
14.3%
r 30000
14.3%
a 30000
14.3%
n 30000
14.3%
d 30000
14.3%
30000
14.3%
Z 10045
 
4.8%
Y 10035
 
4.8%
X 9920
 
4.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 7000000
100.0%
ValueCountFrequency (%)
(unknown) 210000
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
B 1000000
14.3%
r 1000000
14.3%
a 1000000
14.3%
n 1000000
14.3%
d 1000000
14.3%
1000000
14.3%
Y 333775
 
4.8%
Z 333608
 
4.8%
X 332617
 
4.8%
ValueCountFrequency (%)
B 30000
14.3%
r 30000
14.3%
a 30000
14.3%
n 30000
14.3%
d 30000
14.3%
30000
14.3%
Z 10045
 
4.8%
Y 10035
 
4.8%
X 9920
 
4.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 7000000
100.0%
ValueCountFrequency (%)
(unknown) 210000
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
B 1000000
14.3%
r 1000000
14.3%
a 1000000
14.3%
n 1000000
14.3%
d 1000000
14.3%
1000000
14.3%
Y 333775
 
4.8%
Z 333608
 
4.8%
X 332617
 
4.8%
ValueCountFrequency (%)
B 30000
14.3%
r 30000
14.3%
a 30000
14.3%
n 30000
14.3%
d 30000
14.3%
30000
14.3%
Z 10045
 
4.8%
Y 10035
 
4.8%
X 9920
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 7000000
100.0%
ValueCountFrequency (%)
(unknown) 210000
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
B 1000000
14.3%
r 1000000
14.3%
a 1000000
14.3%
n 1000000
14.3%
d 1000000
14.3%
1000000
14.3%
Y 333775
 
4.8%
Z 333608
 
4.8%
X 332617
 
4.8%
ValueCountFrequency (%)
B 30000
14.3%
r 30000
14.3%
a 30000
14.3%
n 30000
14.3%
d 30000
14.3%
30000
14.3%
Z 10045
 
4.8%
Y 10035
 
4.8%
X 9920
 
4.7%

product_rating
Real number (ℝ)

 Full DatasetStratified Sample
Distinct4141
Distinct (%)< 0.1%0.1%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean2.99900963.000136667
 Full DatasetStratified Sample
Minimum11
Maximum55
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:01.070015image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum11
5-th percentile1.21.2
Q122
median33
Q344
95-th percentile4.84.8
Maximum55
Range44
Interquartile range (IQR)22

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation1.1548006031.159605021
Coefficient of variation (CV)0.38506065570.3865173989
Kurtosis-1.196293362-1.202777711
Mean2.99900963.000136667
Median Absolute Deviation (MAD)11
Skewness-0.0005343871929-0.005588273582
Sum2999009.690004.1
Variance1.3335644331.344683804
MonotonicityNot monotonicNot monotonic
2025-06-06T02:29:01.253452image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=41)
ValueCountFrequency (%)
2.9 25242
 
2.5%
3.4 25229
 
2.5%
2.6 25229
 
2.5%
1.3 25194
 
2.5%
3 25181
 
2.5%
4.7 25166
 
2.5%
4.3 25159
 
2.5%
4.1 25146
 
2.5%
1.6 25141
 
2.5%
4 25134
 
2.5%
Other values (31) 748179
74.8%
ValueCountFrequency (%)
4.6 811
 
2.7%
2 796
 
2.7%
1.4 783
 
2.6%
4.4 780
 
2.6%
3.4 780
 
2.6%
1.6 770
 
2.6%
3.3 768
 
2.6%
3.8 768
 
2.6%
4.2 767
 
2.6%
2.6 765
 
2.5%
Other values (31) 22212
74.0%
ValueCountFrequency (%)
1 12653
1.3%
1.1 24871
2.5%
1.2 25095
2.5%
1.3 25194
2.5%
1.4 24848
2.5%
ValueCountFrequency (%)
1 404
1.3%
1.1 760
2.5%
1.2 749
2.5%
1.3 755
2.5%
1.4 783
2.6%
ValueCountFrequency (%)
1 404
< 0.1%
1.1 760
0.1%
1.2 749
0.1%
1.3 755
0.1%
1.4 783
0.1%
ValueCountFrequency (%)
1 12653
42.2%
1.1 24871
82.9%
1.2 25095
83.7%
1.3 25194
84.0%
1.4 24848
82.8%

product_review_count
Real number (ℝ)

 Full DatasetStratified Sample
Distinct10001000
Distinct (%)0.1%3.3%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean499.235198500.4784667
 Full DatasetStratified Sample
Minimum00
Maximum999999
Zeros98740
Zeros (%)0.1%0.1%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:01.494170image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum00
5-th percentile5048.95
Q1250250
median499502
Q3749753
95-th percentile949951
Maximum999999
Range999999
Interquartile range (IQR)499503

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation288.4461496289.9158218
Coefficient of variation (CV)0.57777606780.5792773138
Kurtosis-1.19905271-1.209343923
Mean499.235198500.4784667
Median Absolute Deviation (MAD)250251
Skewness0.001190014496-0.005998671865
Sum49923519815014354
Variance83201.1812284051.18371
MonotonicityNot monotonicNot monotonic
2025-06-06T02:29:01.735857image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
974 1095
 
0.1%
56 1089
 
0.1%
769 1089
 
0.1%
725 1088
 
0.1%
683 1085
 
0.1%
229 1082
 
0.1%
501 1079
 
0.1%
937 1074
 
0.1%
384 1073
 
0.1%
497 1072
 
0.1%
Other values (990) 989174
98.9%
ValueCountFrequency (%)
907 51
 
0.2%
704 47
 
0.2%
90 47
 
0.2%
627 46
 
0.2%
884 44
 
0.1%
785 44
 
0.1%
818 44
 
0.1%
328 43
 
0.1%
769 43
 
0.1%
20 43
 
0.1%
Other values (990) 29548
98.5%
ValueCountFrequency (%)
0 987
0.1%
1 999
0.1%
2 1006
0.1%
3 1006
0.1%
4 1027
0.1%
ValueCountFrequency (%)
0 40
0.1%
1 33
0.1%
2 31
0.1%
3 23
0.1%
4 38
0.1%
ValueCountFrequency (%)
0 40
< 0.1%
1 33
< 0.1%
2 31
< 0.1%
3 23
< 0.1%
4 38
< 0.1%
ValueCountFrequency (%)
0 987
3.3%
1 999
3.3%
2 1006
3.4%
3 1006
3.4%
4 1027
3.4%

product_stock
Real number (ℝ)

 Full DatasetStratified Sample
Distinct100100
Distinct (%)< 0.1%0.3%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean49.51512949.56016667
 Full DatasetStratified Sample
Minimum00
Maximum9999
Zeros10174317
Zeros (%)1.0%1.1%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:01.941589image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum00
5-th percentile54
Q12524
median4950
Q37575
95-th percentile9594
Maximum9999
Range9999
Interquartile range (IQR)5051

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation28.8766452928.97239192
Coefficient of variation (CV)0.58318832790.5845902842
Kurtosis-1.200520476-1.214128381
Mean49.51512949.56016667
Median Absolute Deviation (MAD)2525
Skewness0.0006383736941-0.00846355267
Sum495151291486805
Variance833.860643839.3994933
MonotonicityNot monotonicNot monotonic
2025-06-06T02:29:02.141073image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
89 10261
 
1.0%
70 10245
 
1.0%
60 10187
 
1.0%
23 10175
 
1.0%
0 10174
 
1.0%
54 10171
 
1.0%
44 10148
 
1.0%
96 10147
 
1.0%
32 10138
 
1.0%
77 10136
 
1.0%
Other values (90) 898218
89.8%
ValueCountFrequency (%)
35 340
 
1.1%
18 330
 
1.1%
5 330
 
1.1%
82 328
 
1.1%
70 327
 
1.1%
50 326
 
1.1%
33 325
 
1.1%
66 324
 
1.1%
71 324
 
1.1%
3 323
 
1.1%
Other values (90) 26723
89.1%
ValueCountFrequency (%)
0 10174
1.0%
1 9857
1.0%
2 9895
1.0%
3 10030
1.0%
4 9924
1.0%
ValueCountFrequency (%)
0 317
1.1%
1 272
0.9%
2 299
1.0%
3 323
1.1%
4 313
1.0%
ValueCountFrequency (%)
0 317
< 0.1%
1 272
< 0.1%
2 299
< 0.1%
3 323
< 0.1%
4 313
< 0.1%
ValueCountFrequency (%)
0 10174
33.9%
1 9857
32.9%
2 9895
33.0%
3 10030
33.4%
4 9924
33.1%

product_return_rate
Real number (ℝ)

 Full DatasetStratified Sample
Distinct5151
Distinct (%)< 0.1%0.2%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean0.250137410.2500103333
 Full DatasetStratified Sample
Minimum00
Maximum0.50.5
Zeros9960299
Zeros (%)1.0%1.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:02.363013image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum00
5-th percentile0.030.02
Q10.130.12
median0.250.25
Q30.380.38
95-th percentile0.480.48
Maximum0.50.5
Range0.50.5
Interquartile range (IQR)0.250.26

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation0.14440848960.144895237
Coefficient of variation (CV)0.57731664210.5795569931
Kurtosis-1.197824771-1.200929524
Mean0.250137410.2500103333
Median Absolute Deviation (MAD)0.130.13
Skewness-0.0005165569762-0.006130410715
Sum250137.417500.31
Variance0.020853811870.02099462971
MonotonicityNot monotonicNot monotonic
2025-06-06T02:29:02.578402image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.43 20287
 
2.0%
0.38 20282
 
2.0%
0.03 20242
 
2.0%
0.46 20215
 
2.0%
0.4 20209
 
2.0%
0.14 20164
 
2.0%
0.45 20148
 
2.0%
0.16 20140
 
2.0%
0.06 20135
 
2.0%
0.29 20118
 
2.0%
Other values (41) 798060
79.8%
ValueCountFrequency (%)
0.23 661
 
2.2%
0.28 654
 
2.2%
0.01 649
 
2.2%
0.08 639
 
2.1%
0.44 635
 
2.1%
0.33 633
 
2.1%
0.21 628
 
2.1%
0.42 626
 
2.1%
0.41 625
 
2.1%
0.03 620
 
2.1%
Other values (41) 23630
78.8%
ValueCountFrequency (%)
0 9960
1.0%
0.01 19921
2.0%
0.02 19994
2.0%
0.03 20242
2.0%
0.04 19825
2.0%
ValueCountFrequency (%)
0 299
1.0%
0.01 649
2.2%
0.02 618
2.1%
0.03 620
2.1%
0.04 594
2.0%
ValueCountFrequency (%)
0 299
< 0.1%
0.01 649
0.1%
0.02 618
0.1%
0.03 620
0.1%
0.04 594
0.1%
ValueCountFrequency (%)
0 9960
33.2%
0.01 19921
66.4%
0.02 19994
66.6%
0.03 20242
67.5%
0.04 19825
66.1%

product_size
['Text', 'Text']

 Full DatasetStratified Sample
Distinct33
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:02.813239image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length66
Median length55
Mean length5.3335015.328933333
Min length55

Characters and Unicode

 Full DatasetStratified Sample
Total characters5333501159868
Distinct characters1212
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowSmallMedium
2nd rowMediumSmall
3rd rowMediumSmall
4th rowLargeLarge
5th rowSmallSmall
ValueCountFrequency (%)
large 333964
33.4%
medium 333501
33.4%
small 332535
33.3%
ValueCountFrequency (%)
large 10120
33.7%
small 10012
33.4%
medium 9868
32.9%
2025-06-06T02:29:03.114961image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 667465
12.5%
a 666499
12.5%
m 666036
12.5%
l 665070
12.5%
g 333964
6.3%
r 333964
6.3%
L 333964
6.3%
M 333501
6.3%
i 333501
6.3%
d 333501
6.3%
Other values (2) 666036
12.5%
ValueCountFrequency (%)
a 20132
12.6%
l 20024
12.5%
e 19988
12.5%
m 19880
12.4%
g 10120
6.3%
r 10120
6.3%
L 10120
6.3%
S 10012
6.3%
M 9868
6.2%
d 9868
6.2%
Other values (2) 19736
12.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 5333501
100.0%
ValueCountFrequency (%)
(unknown) 159868
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 667465
12.5%
a 666499
12.5%
m 666036
12.5%
l 665070
12.5%
g 333964
6.3%
r 333964
6.3%
L 333964
6.3%
M 333501
6.3%
i 333501
6.3%
d 333501
6.3%
Other values (2) 666036
12.5%
ValueCountFrequency (%)
a 20132
12.6%
l 20024
12.5%
e 19988
12.5%
m 19880
12.4%
g 10120
6.3%
r 10120
6.3%
L 10120
6.3%
S 10012
6.3%
M 9868
6.2%
d 9868
6.2%
Other values (2) 19736
12.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 5333501
100.0%
ValueCountFrequency (%)
(unknown) 159868
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 667465
12.5%
a 666499
12.5%
m 666036
12.5%
l 665070
12.5%
g 333964
6.3%
r 333964
6.3%
L 333964
6.3%
M 333501
6.3%
i 333501
6.3%
d 333501
6.3%
Other values (2) 666036
12.5%
ValueCountFrequency (%)
a 20132
12.6%
l 20024
12.5%
e 19988
12.5%
m 19880
12.4%
g 10120
6.3%
r 10120
6.3%
L 10120
6.3%
S 10012
6.3%
M 9868
6.2%
d 9868
6.2%
Other values (2) 19736
12.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 5333501
100.0%
ValueCountFrequency (%)
(unknown) 159868
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 667465
12.5%
a 666499
12.5%
m 666036
12.5%
l 665070
12.5%
g 333964
6.3%
r 333964
6.3%
L 333964
6.3%
M 333501
6.3%
i 333501
6.3%
d 333501
6.3%
Other values (2) 666036
12.5%
ValueCountFrequency (%)
a 20132
12.6%
l 20024
12.5%
e 19988
12.5%
m 19880
12.4%
g 10120
6.3%
r 10120
6.3%
L 10120
6.3%
S 10012
6.3%
M 9868
6.2%
d 9868
6.2%
Other values (2) 19736
12.3%

product_weight
Real number (ℝ)

 Full DatasetStratified Sample
Distinct991991
Distinct (%)0.1%3.3%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean5.054372385.073211
 Full DatasetStratified Sample
Minimum0.10.1
Maximum1010
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:03.286583image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum0.10.1
5-th percentile0.60.61
Q12.582.6
median5.065.09
Q37.537.56
95-th percentile9.59.49
Maximum1010
Range9.99.9
Interquartile range (IQR)4.954.96

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation2.8578484872.855194911
Coefficient of variation (CV)0.565421040.5627983757
Kurtosis-1.200012392-1.203524603
Mean5.054372385.073211
Median Absolute Deviation (MAD)2.472.48
Skewness-0.001975515497-0.01206535638
Sum5054372.38152196.33
Variance8.1672979778.152137977
MonotonicityNot monotonicNot monotonic
2025-06-06T02:29:03.518388image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2.51 1094
 
0.1%
7.79 1092
 
0.1%
3.96 1089
 
0.1%
3.55 1089
 
0.1%
1.61 1088
 
0.1%
5.24 1088
 
0.1%
1.74 1087
 
0.1%
4.66 1085
 
0.1%
3.04 1082
 
0.1%
1.22 1081
 
0.1%
Other values (981) 989125
98.9%
ValueCountFrequency (%)
4.7 51
 
0.2%
9.34 48
 
0.2%
8.52 47
 
0.2%
2.3 46
 
0.2%
7.86 46
 
0.2%
7.57 46
 
0.2%
9.31 46
 
0.2%
3.34 45
 
0.1%
6.89 45
 
0.1%
1.61 44
 
0.1%
Other values (981) 29536
98.5%
ValueCountFrequency (%)
0.1 506
0.1%
0.11 1031
0.1%
0.12 996
0.1%
0.13 1001
0.1%
0.14 1007
0.1%
ValueCountFrequency (%)
0.1 11
 
< 0.1%
0.11 33
0.1%
0.12 38
0.1%
0.13 37
0.1%
0.14 28
0.1%
ValueCountFrequency (%)
0.1 11
 
< 0.1%
0.11 33
< 0.1%
0.12 38
< 0.1%
0.13 37
< 0.1%
0.14 28
< 0.1%
ValueCountFrequency (%)
0.1 506
1.7%
0.11 1031
3.4%
0.12 996
3.3%
0.13 1001
3.3%
0.14 1007
3.4%

product_color
['Text', 'Text']

 Full DatasetStratified Sample
Distinct55
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:03.792191image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length55
Median length55
Mean length4.3993974.3939
Min length33

Characters and Unicode

 Full DatasetStratified Sample
Total characters4399397131817
Distinct characters1616
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowRedGreen
2nd rowBlueRed
3rd rowGreenWhite
4th rowBlueBlue
5th rowRedGreen
ValueCountFrequency (%)
blue 200671
20.1%
green 200202
20.0%
red 199966
20.0%
black 199704
20.0%
white 199457
19.9%
ValueCountFrequency (%)
red 6070
20.2%
blue 6043
20.1%
black 6007
20.0%
green 5974
19.9%
white 5906
19.7%
2025-06-06T02:29:04.149043image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 1000498
22.7%
B 400375
 
9.1%
l 400375
 
9.1%
u 200671
 
4.6%
G 200202
 
4.6%
r 200202
 
4.6%
n 200202
 
4.6%
R 199966
 
4.5%
d 199966
 
4.5%
a 199704
 
4.5%
Other values (6) 1197236
27.2%
ValueCountFrequency (%)
e 29967
22.7%
B 12050
 
9.1%
l 12050
 
9.1%
R 6070
 
4.6%
d 6070
 
4.6%
u 6043
 
4.6%
a 6007
 
4.6%
c 6007
 
4.6%
k 6007
 
4.6%
G 5974
 
4.5%
Other values (6) 35572
27.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 4399397
100.0%
ValueCountFrequency (%)
(unknown) 131817
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 1000498
22.7%
B 400375
 
9.1%
l 400375
 
9.1%
u 200671
 
4.6%
G 200202
 
4.6%
r 200202
 
4.6%
n 200202
 
4.6%
R 199966
 
4.5%
d 199966
 
4.5%
a 199704
 
4.5%
Other values (6) 1197236
27.2%
ValueCountFrequency (%)
e 29967
22.7%
B 12050
 
9.1%
l 12050
 
9.1%
R 6070
 
4.6%
d 6070
 
4.6%
u 6043
 
4.6%
a 6007
 
4.6%
c 6007
 
4.6%
k 6007
 
4.6%
G 5974
 
4.5%
Other values (6) 35572
27.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 4399397
100.0%
ValueCountFrequency (%)
(unknown) 131817
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 1000498
22.7%
B 400375
 
9.1%
l 400375
 
9.1%
u 200671
 
4.6%
G 200202
 
4.6%
r 200202
 
4.6%
n 200202
 
4.6%
R 199966
 
4.5%
d 199966
 
4.5%
a 199704
 
4.5%
Other values (6) 1197236
27.2%
ValueCountFrequency (%)
e 29967
22.7%
B 12050
 
9.1%
l 12050
 
9.1%
R 6070
 
4.6%
d 6070
 
4.6%
u 6043
 
4.6%
a 6007
 
4.6%
c 6007
 
4.6%
k 6007
 
4.6%
G 5974
 
4.5%
Other values (6) 35572
27.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 4399397
100.0%
ValueCountFrequency (%)
(unknown) 131817
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 1000498
22.7%
B 400375
 
9.1%
l 400375
 
9.1%
u 200671
 
4.6%
G 200202
 
4.6%
r 200202
 
4.6%
n 200202
 
4.6%
R 199966
 
4.5%
d 199966
 
4.5%
a 199704
 
4.5%
Other values (6) 1197236
27.2%
ValueCountFrequency (%)
e 29967
22.7%
B 12050
 
9.1%
l 12050
 
9.1%
R 6070
 
4.6%
d 6070
 
4.6%
u 6043
 
4.6%
a 6007
 
4.6%
c 6007
 
4.6%
k 6007
 
4.6%
G 5974
 
4.5%
Other values (6) 35572
27.0%

product_material
['Text', 'Text']

 Full DatasetStratified Sample
Distinct44
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:04.365608image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length77
Median length55
Mean length5.250875.262533333
Min length44

Characters and Unicode

 Full DatasetStratified Sample
Total characters5250870157876
Distinct characters1313
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowMetalWood
2nd rowMetalWood
3rd rowPlasticPlastic
4th rowWoodGlass
5th rowMetalPlastic
ValueCountFrequency (%)
plastic 250483
25.0%
wood 250096
25.0%
metal 249896
25.0%
glass 249525
25.0%
ValueCountFrequency (%)
plastic 7669
25.6%
glass 7513
25.0%
wood 7462
24.9%
metal 7356
24.5%
2025-06-06T02:29:04.684627image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
l 749904
14.3%
a 749904
14.3%
s 749533
14.3%
t 500379
9.5%
o 500192
9.5%
P 250483
 
4.8%
i 250483
 
4.8%
c 250483
 
4.8%
W 250096
 
4.8%
d 250096
 
4.8%
Other values (3) 749317
14.3%
ValueCountFrequency (%)
s 22695
14.4%
a 22538
14.3%
l 22538
14.3%
t 15025
9.5%
o 14924
9.5%
P 7669
 
4.9%
i 7669
 
4.9%
c 7669
 
4.9%
G 7513
 
4.8%
W 7462
 
4.7%
Other values (3) 22174
14.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 5250870
100.0%
ValueCountFrequency (%)
(unknown) 157876
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
l 749904
14.3%
a 749904
14.3%
s 749533
14.3%
t 500379
9.5%
o 500192
9.5%
P 250483
 
4.8%
i 250483
 
4.8%
c 250483
 
4.8%
W 250096
 
4.8%
d 250096
 
4.8%
Other values (3) 749317
14.3%
ValueCountFrequency (%)
s 22695
14.4%
a 22538
14.3%
l 22538
14.3%
t 15025
9.5%
o 14924
9.5%
P 7669
 
4.9%
i 7669
 
4.9%
c 7669
 
4.9%
G 7513
 
4.8%
W 7462
 
4.7%
Other values (3) 22174
14.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 5250870
100.0%
ValueCountFrequency (%)
(unknown) 157876
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
l 749904
14.3%
a 749904
14.3%
s 749533
14.3%
t 500379
9.5%
o 500192
9.5%
P 250483
 
4.8%
i 250483
 
4.8%
c 250483
 
4.8%
W 250096
 
4.8%
d 250096
 
4.8%
Other values (3) 749317
14.3%
ValueCountFrequency (%)
s 22695
14.4%
a 22538
14.3%
l 22538
14.3%
t 15025
9.5%
o 14924
9.5%
P 7669
 
4.9%
i 7669
 
4.9%
c 7669
 
4.9%
G 7513
 
4.8%
W 7462
 
4.7%
Other values (3) 22174
14.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 5250870
100.0%
ValueCountFrequency (%)
(unknown) 157876
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
l 749904
14.3%
a 749904
14.3%
s 749533
14.3%
t 500379
9.5%
o 500192
9.5%
P 250483
 
4.8%
i 250483
 
4.8%
c 250483
 
4.8%
W 250096
 
4.8%
d 250096
 
4.8%
Other values (3) 749317
14.3%
ValueCountFrequency (%)
s 22695
14.4%
a 22538
14.3%
l 22538
14.3%
t 15025
9.5%
o 14924
9.5%
P 7669
 
4.9%
i 7669
 
4.9%
c 7669
 
4.9%
G 7513
 
4.8%
W 7462
 
4.7%
Other values (3) 22174
14.0%

product_manufacture_date
['Text', 'Text']

 Full DatasetStratified Sample
Distinct99203729992
Distinct (%)99.2%> 99.9%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:05.387722image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length1919
Median length1919
Mean length1919
Min length1919

Characters and Unicode

 Full DatasetStratified Sample
Total characters19000000570000
Distinct characters1313
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique98412629984 ?
Unique (%)98.4%99.9%

Sample

 Full DatasetStratified Sample
1st row2019-08-04 01:47:012019-03-16 10:53:28
2nd row2019-10-23 19:59:172018-09-16 06:18:11
3rd row2018-05-12 08:00:292018-12-23 10:31:52
4th row2019-11-15 16:17:292019-10-23 08:43:30
5th row2019-08-27 02:58:192019-01-02 12:03:08
ValueCountFrequency (%)
2018-04-10 1514
 
0.1%
2019-03-19 1490
 
0.1%
2018-02-26 1471
 
0.1%
2018-06-18 1467
 
0.1%
2018-09-24 1462
 
0.1%
2019-01-25 1457
 
0.1%
2019-01-30 1456
 
0.1%
2018-04-28 1454
 
0.1%
2019-07-04 1453
 
0.1%
2019-01-09 1453
 
0.1%
Other values (87119) 1985323
99.3%
ValueCountFrequency (%)
2019-09-18 61
 
0.1%
2018-02-06 59
 
0.1%
2018-12-13 59
 
0.1%
2019-02-03 58
 
0.1%
2019-01-20 55
 
0.1%
2019-02-14 55
 
0.1%
2018-10-01 55
 
0.1%
2018-04-10 54
 
0.1%
2019-02-01 54
 
0.1%
2019-12-28 54
 
0.1%
Other values (25987) 59436
99.1%
2025-06-06T02:29:06.195256image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3297851
17.4%
1 2942646
15.5%
2 2411227
12.7%
- 2000000
10.5%
: 2000000
10.5%
1000000
 
5.3%
8 967248
 
5.1%
9 961589
 
5.1%
3 891404
 
4.7%
5 799916
 
4.2%
Other values (3) 1728119
9.1%
ValueCountFrequency (%)
0 99034
17.4%
1 88344
15.5%
2 72210
12.7%
- 60000
10.5%
: 60000
10.5%
30000
 
5.3%
8 29035
 
5.1%
9 28774
 
5.0%
3 27023
 
4.7%
4 24155
 
4.2%
Other values (3) 51425
9.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 19000000
100.0%
ValueCountFrequency (%)
(unknown) 570000
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 3297851
17.4%
1 2942646
15.5%
2 2411227
12.7%
- 2000000
10.5%
: 2000000
10.5%
1000000
 
5.3%
8 967248
 
5.1%
9 961589
 
5.1%
3 891404
 
4.7%
5 799916
 
4.2%
Other values (3) 1728119
9.1%
ValueCountFrequency (%)
0 99034
17.4%
1 88344
15.5%
2 72210
12.7%
- 60000
10.5%
: 60000
10.5%
30000
 
5.3%
8 29035
 
5.1%
9 28774
 
5.0%
3 27023
 
4.7%
4 24155
 
4.2%
Other values (3) 51425
9.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 19000000
100.0%
ValueCountFrequency (%)
(unknown) 570000
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 3297851
17.4%
1 2942646
15.5%
2 2411227
12.7%
- 2000000
10.5%
: 2000000
10.5%
1000000
 
5.3%
8 967248
 
5.1%
9 961589
 
5.1%
3 891404
 
4.7%
5 799916
 
4.2%
Other values (3) 1728119
9.1%
ValueCountFrequency (%)
0 99034
17.4%
1 88344
15.5%
2 72210
12.7%
- 60000
10.5%
: 60000
10.5%
30000
 
5.3%
8 29035
 
5.1%
9 28774
 
5.0%
3 27023
 
4.7%
4 24155
 
4.2%
Other values (3) 51425
9.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 19000000
100.0%
ValueCountFrequency (%)
(unknown) 570000
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 3297851
17.4%
1 2942646
15.5%
2 2411227
12.7%
- 2000000
10.5%
: 2000000
10.5%
1000000
 
5.3%
8 967248
 
5.1%
9 961589
 
5.1%
3 891404
 
4.7%
5 799916
 
4.2%
Other values (3) 1728119
9.1%
ValueCountFrequency (%)
0 99034
17.4%
1 88344
15.5%
2 72210
12.7%
- 60000
10.5%
: 60000
10.5%
30000
 
5.3%
8 29035
 
5.1%
9 28774
 
5.0%
3 27023
 
4.7%
4 24155
 
4.2%
Other values (3) 51425
9.0%

product_expiry_date
['Text', 'Text']

 Full DatasetStratified Sample
Distinct99204229990
Distinct (%)99.2%> 99.9%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:06.922611image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length1919
Median length1919
Mean length1919
Min length1919

Characters and Unicode

 Full DatasetStratified Sample
Total characters19000000570000
Distinct characters1313
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique98412129980 ?
Unique (%)98.4%99.9%

Sample

 Full DatasetStratified Sample
1st row2022-05-28 14:54:022022-01-03 18:34:16
2nd row2022-12-19 08:04:412023-12-15 07:55:54
3rd row2023-02-01 12:15:072023-01-10 02:39:45
4th row2023-02-05 11:46:572022-08-09 14:28:49
5th row2023-10-05 08:13:072022-11-19 06:40:08
ValueCountFrequency (%)
2022-12-22 1476
 
0.1%
2022-06-08 1475
 
0.1%
2022-01-28 1473
 
0.1%
2023-03-13 1472
 
0.1%
2022-10-06 1468
 
0.1%
2022-06-27 1468
 
0.1%
2023-07-08 1457
 
0.1%
2023-07-23 1457
 
0.1%
2023-01-11 1452
 
0.1%
2022-04-17 1450
 
0.1%
Other values (87119) 1985352
99.3%
ValueCountFrequency (%)
2022-04-24 68
 
0.1%
2023-11-08 64
 
0.1%
2023-10-01 63
 
0.1%
2022-07-14 61
 
0.1%
2022-05-05 57
 
0.1%
2023-06-01 57
 
0.1%
2022-08-22 56
 
0.1%
2022-05-27 56
 
0.1%
2023-12-09 56
 
0.1%
2023-04-30 56
 
0.1%
Other values (26089) 59406
99.0%
2025-06-06T02:29:07.713404image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 3911569
20.6%
0 3298912
17.4%
- 2000000
10.5%
: 2000000
10.5%
1 1939371
10.2%
3 1392537
 
7.3%
1000000
 
5.3%
5 800253
 
4.2%
4 798405
 
4.2%
8 467536
 
2.5%
Other values (3) 1391417
 
7.3%
ValueCountFrequency (%)
2 117113
20.5%
0 98988
17.4%
- 60000
10.5%
: 60000
10.5%
1 58442
10.3%
3 41845
 
7.3%
30000
 
5.3%
5 24111
 
4.2%
4 23724
 
4.2%
6 14101
 
2.5%
Other values (3) 41676
 
7.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 19000000
100.0%
ValueCountFrequency (%)
(unknown) 570000
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
2 3911569
20.6%
0 3298912
17.4%
- 2000000
10.5%
: 2000000
10.5%
1 1939371
10.2%
3 1392537
 
7.3%
1000000
 
5.3%
5 800253
 
4.2%
4 798405
 
4.2%
8 467536
 
2.5%
Other values (3) 1391417
 
7.3%
ValueCountFrequency (%)
2 117113
20.5%
0 98988
17.4%
- 60000
10.5%
: 60000
10.5%
1 58442
10.3%
3 41845
 
7.3%
30000
 
5.3%
5 24111
 
4.2%
4 23724
 
4.2%
6 14101
 
2.5%
Other values (3) 41676
 
7.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 19000000
100.0%
ValueCountFrequency (%)
(unknown) 570000
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
2 3911569
20.6%
0 3298912
17.4%
- 2000000
10.5%
: 2000000
10.5%
1 1939371
10.2%
3 1392537
 
7.3%
1000000
 
5.3%
5 800253
 
4.2%
4 798405
 
4.2%
8 467536
 
2.5%
Other values (3) 1391417
 
7.3%
ValueCountFrequency (%)
2 117113
20.5%
0 98988
17.4%
- 60000
10.5%
: 60000
10.5%
1 58442
10.3%
3 41845
 
7.3%
30000
 
5.3%
5 24111
 
4.2%
4 23724
 
4.2%
6 14101
 
2.5%
Other values (3) 41676
 
7.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 19000000
100.0%
ValueCountFrequency (%)
(unknown) 570000
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
2 3911569
20.6%
0 3298912
17.4%
- 2000000
10.5%
: 2000000
10.5%
1 1939371
10.2%
3 1392537
 
7.3%
1000000
 
5.3%
5 800253
 
4.2%
4 798405
 
4.2%
8 467536
 
2.5%
Other values (3) 1391417
 
7.3%
ValueCountFrequency (%)
2 117113
20.5%
0 98988
17.4%
- 60000
10.5%
: 60000
10.5%
1 58442
10.3%
3 41845
 
7.3%
30000
 
5.3%
5 24111
 
4.2%
4 23724
 
4.2%
6 14101
 
2.5%
Other values (3) 41676
 
7.3%

product_shelf_life
Real number (ℝ)

 Full DatasetStratified Sample
Distinct365365
Distinct (%)< 0.1%1.2%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean181.876207181.6765667
 Full DatasetStratified Sample
Minimum00
Maximum364364
Zeros271393
Zeros (%)0.3%0.3%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:07.882636image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum00
5-th percentile1818
Q19191
median182182
Q3273273
95-th percentile346346
Maximum364364
Range364364
Interquartile range (IQR)182182

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation105.2288552105.2236841
Coefficient of variation (CV)0.57857405850.5791813772
Kurtosis-1.198082782-1.198336941
Mean181.876207181.6765667
Median Absolute Deviation (MAD)9191
Skewness0.00062292044493.395146203 × 10-6
Sum1818762075450297
Variance11073.1119711072.02369
MonotonicityNot monotonicNot monotonic
2025-06-06T02:29:08.115260image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
87 2893
 
0.3%
272 2874
 
0.3%
70 2870
 
0.3%
250 2870
 
0.3%
210 2862
 
0.3%
224 2859
 
0.3%
238 2857
 
0.3%
33 2848
 
0.3%
297 2847
 
0.3%
171 2845
 
0.3%
Other values (355) 971375
97.1%
ValueCountFrequency (%)
95 111
 
0.4%
196 102
 
0.3%
282 100
 
0.3%
142 100
 
0.3%
263 100
 
0.3%
304 100
 
0.3%
169 99
 
0.3%
284 99
 
0.3%
74 98
 
0.3%
92 98
 
0.3%
Other values (355) 28993
96.6%
ValueCountFrequency (%)
0 2713
0.3%
1 2788
0.3%
2 2776
0.3%
3 2725
0.3%
4 2788
0.3%
ValueCountFrequency (%)
0 93
0.3%
1 88
0.3%
2 88
0.3%
3 80
0.3%
4 89
0.3%
ValueCountFrequency (%)
0 93
< 0.1%
1 88
< 0.1%
2 88
< 0.1%
3 80
< 0.1%
4 89
< 0.1%
ValueCountFrequency (%)
0 2713
9.0%
1 2788
9.3%
2 2776
9.3%
3 2725
9.1%
4 2788
9.3%

promotion_id
Real number (ℝ)

 Full DatasetStratified Sample
Distinct999999
Distinct (%)0.1%3.3%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean499.920037500.8484333
 Full DatasetStratified Sample
Minimum11
Maximum999999
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:08.340172image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum11
5-th percentile5050
Q1250253
median500503.5
Q3750748
95-th percentile949949
Maximum999999
Range998998
Interquartile range (IQR)500495

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation288.4530565287.71409
Coefficient of variation (CV)0.576998390.5744534091
Kurtosis-1.200677574-1.19046795
Mean499.920037500.8484333
Median Absolute Deviation (MAD)250247.5
Skewness-0.0008935044332-0.004262817372
Sum49992003715025453
Variance83205.1657982779.39757
MonotonicityNot monotonicNot monotonic
2025-06-06T02:29:08.550450image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
52 1092
 
0.1%
94 1082
 
0.1%
374 1079
 
0.1%
117 1077
 
0.1%
29 1075
 
0.1%
603 1075
 
0.1%
512 1073
 
0.1%
949 1073
 
0.1%
885 1073
 
0.1%
51 1070
 
0.1%
Other values (989) 989231
98.9%
ValueCountFrequency (%)
611 50
 
0.2%
305 47
 
0.2%
339 46
 
0.2%
949 45
 
0.1%
419 45
 
0.1%
240 45
 
0.1%
11 44
 
0.1%
525 44
 
0.1%
211 44
 
0.1%
203 44
 
0.1%
Other values (989) 29546
98.5%
ValueCountFrequency (%)
1 1033
0.1%
2 995
0.1%
3 1036
0.1%
4 1024
0.1%
5 992
0.1%
ValueCountFrequency (%)
1 34
0.1%
2 31
0.1%
3 44
0.1%
4 34
0.1%
5 17
 
0.1%
ValueCountFrequency (%)
1 34
< 0.1%
2 31
< 0.1%
3 44
< 0.1%
4 34
< 0.1%
5 17
 
< 0.1%
ValueCountFrequency (%)
1 1033
3.4%
2 995
3.3%
3 1036
3.5%
4 1024
3.4%
5 992
3.3%

promotion_type
['Text', 'Text']

 Full DatasetStratified Sample
Distinct33
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:08.782718image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length2020
Median length1010
Mean length12.33406412.35276667
Min length77

Characters and Unicode

 Full DatasetStratified Sample
Total characters12334064370583
Distinct characters2020
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st row20% OffBuy One Get One Free
2nd rowFlash Sale20% Off
3rd rowFlash Sale20% Off
4th rowBuy One Get One FreeFlash Sale
5th rowFlash Sale20% Off
ValueCountFrequency (%)
one 667040
22.2%
20 333712
11.1%
off 333712
11.1%
buy 333520
11.1%
get 333520
11.1%
free 333520
11.1%
flash 332768
11.1%
sale 332768
11.1%
ValueCountFrequency (%)
one 20110
22.3%
buy 10055
11.2%
get 10055
11.2%
free 10055
11.2%
20 9989
11.1%
off 9989
11.1%
flash 9956
11.0%
sale 9956
11.0%
2025-06-06T02:29:09.122156image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2000560
16.2%
e 2000368
16.2%
O 1000752
 
8.1%
f 667424
 
5.4%
n 667040
 
5.4%
F 666288
 
5.4%
a 665536
 
5.4%
l 665536
 
5.4%
% 333712
 
2.7%
0 333712
 
2.7%
Other values (10) 3333136
27.0%
ValueCountFrequency (%)
e 60231
16.3%
60165
16.2%
O 30099
 
8.1%
n 20110
 
5.4%
F 20011
 
5.4%
f 19978
 
5.4%
a 19912
 
5.4%
l 19912
 
5.4%
y 10055
 
2.7%
u 10055
 
2.7%
Other values (10) 100055
27.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 12334064
100.0%
ValueCountFrequency (%)
(unknown) 370583
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
2000560
16.2%
e 2000368
16.2%
O 1000752
 
8.1%
f 667424
 
5.4%
n 667040
 
5.4%
F 666288
 
5.4%
a 665536
 
5.4%
l 665536
 
5.4%
% 333712
 
2.7%
0 333712
 
2.7%
Other values (10) 3333136
27.0%
ValueCountFrequency (%)
e 60231
16.3%
60165
16.2%
O 30099
 
8.1%
n 20110
 
5.4%
F 20011
 
5.4%
f 19978
 
5.4%
a 19912
 
5.4%
l 19912
 
5.4%
y 10055
 
2.7%
u 10055
 
2.7%
Other values (10) 100055
27.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 12334064
100.0%
ValueCountFrequency (%)
(unknown) 370583
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
2000560
16.2%
e 2000368
16.2%
O 1000752
 
8.1%
f 667424
 
5.4%
n 667040
 
5.4%
F 666288
 
5.4%
a 665536
 
5.4%
l 665536
 
5.4%
% 333712
 
2.7%
0 333712
 
2.7%
Other values (10) 3333136
27.0%
ValueCountFrequency (%)
e 60231
16.3%
60165
16.2%
O 30099
 
8.1%
n 20110
 
5.4%
F 20011
 
5.4%
f 19978
 
5.4%
a 19912
 
5.4%
l 19912
 
5.4%
y 10055
 
2.7%
u 10055
 
2.7%
Other values (10) 100055
27.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 12334064
100.0%
ValueCountFrequency (%)
(unknown) 370583
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
2000560
16.2%
e 2000368
16.2%
O 1000752
 
8.1%
f 667424
 
5.4%
n 667040
 
5.4%
F 666288
 
5.4%
a 665536
 
5.4%
l 665536
 
5.4%
% 333712
 
2.7%
0 333712
 
2.7%
Other values (10) 3333136
27.0%
ValueCountFrequency (%)
e 60231
16.3%
60165
16.2%
O 30099
 
8.1%
n 20110
 
5.4%
F 20011
 
5.4%
f 19978
 
5.4%
a 19912
 
5.4%
l 19912
 
5.4%
y 10055
 
2.7%
u 10055
 
2.7%
Other values (10) 100055
27.0%

promotion_start_date
['Text', 'Text']

 Full DatasetStratified Sample
Distinct98425829981
Distinct (%)98.4%99.9%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:10.038743image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length1919
Median length1919
Mean length1919
Min length1919

Characters and Unicode

 Full DatasetStratified Sample
Total characters19000000570000
Distinct characters1313
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique96868129962 ?
Unique (%)96.9%99.9%

Sample

 Full DatasetStratified Sample
1st row2021-07-14 14:28:422021-10-25 19:14:52
2nd row2021-09-23 04:26:092021-04-19 09:43:32
3rd row2021-06-13 12:31:152021-09-11 21:37:21
4th row2021-05-23 05:42:482021-10-20 06:59:40
5th row2021-04-19 04:55:322021-11-29 22:29:55
ValueCountFrequency (%)
2021-03-05 2885
 
0.1%
2021-02-07 2874
 
0.1%
2021-06-23 2871
 
0.1%
2021-05-15 2867
 
0.1%
2021-08-27 2863
 
0.1%
2021-11-04 2862
 
0.1%
2021-03-25 2858
 
0.1%
2021-09-06 2854
 
0.1%
2021-12-21 2851
 
0.1%
2021-08-06 2850
 
0.1%
Other values (86754) 1971365
98.6%
ValueCountFrequency (%)
2021-09-19 116
 
0.2%
2021-12-10 111
 
0.2%
2021-12-01 107
 
0.2%
2021-04-13 106
 
0.2%
2021-05-20 106
 
0.2%
2021-08-27 104
 
0.2%
2021-06-26 103
 
0.2%
2021-10-22 102
 
0.2%
2021-06-27 101
 
0.2%
2021-08-15 101
 
0.2%
Other values (25739) 58943
98.2%
2025-06-06T02:29:11.231378image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 3411867
18.0%
0 3300257
17.4%
1 2938965
15.5%
- 2000000
10.5%
: 2000000
10.5%
1000000
 
5.3%
3 890896
 
4.7%
5 800172
 
4.2%
4 796605
 
4.2%
7 468643
 
2.5%
Other values (3) 1392595
7.3%
ValueCountFrequency (%)
2 102720
18.0%
0 99050
17.4%
1 87808
15.4%
- 60000
10.5%
: 60000
10.5%
30000
 
5.3%
3 27059
 
4.7%
5 23833
 
4.2%
4 23806
 
4.2%
8 14029
 
2.5%
Other values (3) 41695
7.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 19000000
100.0%
ValueCountFrequency (%)
(unknown) 570000
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
2 3411867
18.0%
0 3300257
17.4%
1 2938965
15.5%
- 2000000
10.5%
: 2000000
10.5%
1000000
 
5.3%
3 890896
 
4.7%
5 800172
 
4.2%
4 796605
 
4.2%
7 468643
 
2.5%
Other values (3) 1392595
7.3%
ValueCountFrequency (%)
2 102720
18.0%
0 99050
17.4%
1 87808
15.4%
- 60000
10.5%
: 60000
10.5%
30000
 
5.3%
3 27059
 
4.7%
5 23833
 
4.2%
4 23806
 
4.2%
8 14029
 
2.5%
Other values (3) 41695
7.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 19000000
100.0%
ValueCountFrequency (%)
(unknown) 570000
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
2 3411867
18.0%
0 3300257
17.4%
1 2938965
15.5%
- 2000000
10.5%
: 2000000
10.5%
1000000
 
5.3%
3 890896
 
4.7%
5 800172
 
4.2%
4 796605
 
4.2%
7 468643
 
2.5%
Other values (3) 1392595
7.3%
ValueCountFrequency (%)
2 102720
18.0%
0 99050
17.4%
1 87808
15.4%
- 60000
10.5%
: 60000
10.5%
30000
 
5.3%
3 27059
 
4.7%
5 23833
 
4.2%
4 23806
 
4.2%
8 14029
 
2.5%
Other values (3) 41695
7.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 19000000
100.0%
ValueCountFrequency (%)
(unknown) 570000
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
2 3411867
18.0%
0 3300257
17.4%
1 2938965
15.5%
- 2000000
10.5%
: 2000000
10.5%
1000000
 
5.3%
3 890896
 
4.7%
5 800172
 
4.2%
4 796605
 
4.2%
7 468643
 
2.5%
Other values (3) 1392595
7.3%
ValueCountFrequency (%)
2 102720
18.0%
0 99050
17.4%
1 87808
15.4%
- 60000
10.5%
: 60000
10.5%
30000
 
5.3%
3 27059
 
4.7%
5 23833
 
4.2%
4 23806
 
4.2%
8 14029
 
2.5%
Other values (3) 41695
7.3%

promotion_end_date
['Text', 'Text']

 Full DatasetStratified Sample
Distinct98425229988
Distinct (%)98.4%> 99.9%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:12.255299image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length1919
Median length1919
Mean length1919
Min length1919

Characters and Unicode

 Full DatasetStratified Sample
Total characters19000000570000
Distinct characters1313
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique96867629976 ?
Unique (%)96.9%99.9%

Sample

 Full DatasetStratified Sample
1st row2022-12-30 13:04:132022-09-18 10:14:41
2nd row2022-09-13 03:16:262022-02-20 19:03:50
3rd row2022-03-13 00:53:352022-06-30 21:32:34
4th row2022-02-06 00:42:302022-05-11 20:34:34
5th row2022-12-04 13:07:092022-02-13 01:03:21
ValueCountFrequency (%)
2022-08-06 2905
 
0.1%
2022-03-08 2896
 
0.1%
2022-09-28 2874
 
0.1%
2022-02-22 2873
 
0.1%
2022-09-16 2872
 
0.1%
2022-06-10 2865
 
0.1%
2022-07-13 2858
 
0.1%
2022-12-15 2854
 
0.1%
2022-12-31 2847
 
0.1%
2022-08-25 2842
 
0.1%
Other values (86755) 1971314
98.6%
ValueCountFrequency (%)
2022-01-15 111
 
0.2%
2022-09-29 109
 
0.2%
2022-07-18 109
 
0.2%
2022-07-15 104
 
0.2%
2022-05-04 103
 
0.2%
2022-09-18 102
 
0.2%
2022-10-04 102
 
0.2%
2022-03-30 100
 
0.2%
2022-02-04 100
 
0.2%
2022-03-17 99
 
0.2%
Other values (25838) 58961
98.3%
2025-06-06T02:29:13.088560image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 4411397
23.2%
0 3299242
17.4%
- 2000000
10.5%
: 2000000
10.5%
1 1940750
10.2%
1000000
 
5.3%
3 890892
 
4.7%
5 800852
 
4.2%
4 797493
 
4.2%
8 467852
 
2.5%
Other values (3) 1391522
 
7.3%
ValueCountFrequency (%)
2 132124
23.2%
0 99096
17.4%
- 60000
10.5%
: 60000
10.5%
1 58434
10.3%
30000
 
5.3%
3 26803
 
4.7%
5 24161
 
4.2%
4 23859
 
4.2%
8 14080
 
2.5%
Other values (3) 41443
 
7.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 19000000
100.0%
ValueCountFrequency (%)
(unknown) 570000
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
2 4411397
23.2%
0 3299242
17.4%
- 2000000
10.5%
: 2000000
10.5%
1 1940750
10.2%
1000000
 
5.3%
3 890892
 
4.7%
5 800852
 
4.2%
4 797493
 
4.2%
8 467852
 
2.5%
Other values (3) 1391522
 
7.3%
ValueCountFrequency (%)
2 132124
23.2%
0 99096
17.4%
- 60000
10.5%
: 60000
10.5%
1 58434
10.3%
30000
 
5.3%
3 26803
 
4.7%
5 24161
 
4.2%
4 23859
 
4.2%
8 14080
 
2.5%
Other values (3) 41443
 
7.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 19000000
100.0%
ValueCountFrequency (%)
(unknown) 570000
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
2 4411397
23.2%
0 3299242
17.4%
- 2000000
10.5%
: 2000000
10.5%
1 1940750
10.2%
1000000
 
5.3%
3 890892
 
4.7%
5 800852
 
4.2%
4 797493
 
4.2%
8 467852
 
2.5%
Other values (3) 1391522
 
7.3%
ValueCountFrequency (%)
2 132124
23.2%
0 99096
17.4%
- 60000
10.5%
: 60000
10.5%
1 58434
10.3%
30000
 
5.3%
3 26803
 
4.7%
5 24161
 
4.2%
4 23859
 
4.2%
8 14080
 
2.5%
Other values (3) 41443
 
7.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 19000000
100.0%
ValueCountFrequency (%)
(unknown) 570000
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
2 4411397
23.2%
0 3299242
17.4%
- 2000000
10.5%
: 2000000
10.5%
1 1940750
10.2%
1000000
 
5.3%
3 890892
 
4.7%
5 800852
 
4.2%
4 797493
 
4.2%
8 467852
 
2.5%
Other values (3) 1391522
 
7.3%
ValueCountFrequency (%)
2 132124
23.2%
0 99096
17.4%
- 60000
10.5%
: 60000
10.5%
1 58434
10.3%
30000
 
5.3%
3 26803
 
4.7%
5 24161
 
4.2%
4 23859
 
4.2%
8 14080
 
2.5%
Other values (3) 41443
 
7.3%

promotion_effectiveness
['Text', 'Text']

 Full DatasetStratified Sample
Distinct33
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:13.316963image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length66
Median length44
Mean length4.3334074.338333333
Min length33

Characters and Unicode

 Full DatasetStratified Sample
Total characters4333407130150
Distinct characters1212
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowHighHigh
2nd rowLowHigh
3rd rowLowHigh
4th rowHighHigh
5th rowMediumMedium
ValueCountFrequency (%)
high 333660
33.4%
medium 333249
33.3%
low 333091
33.3%
ValueCountFrequency (%)
high 10171
33.9%
medium 9993
33.3%
low 9836
32.8%
2025-06-06T02:29:14.229721image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 666909
15.4%
H 333660
7.7%
g 333660
7.7%
h 333660
7.7%
M 333249
7.7%
e 333249
7.7%
d 333249
7.7%
u 333249
7.7%
m 333249
7.7%
L 333091
7.7%
Other values (2) 666182
15.4%
ValueCountFrequency (%)
i 20164
15.5%
H 10171
7.8%
g 10171
7.8%
h 10171
7.8%
M 9993
7.7%
e 9993
7.7%
d 9993
7.7%
u 9993
7.7%
m 9993
7.7%
L 9836
7.6%
Other values (2) 19672
15.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 4333407
100.0%
ValueCountFrequency (%)
(unknown) 130150
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
i 666909
15.4%
H 333660
7.7%
g 333660
7.7%
h 333660
7.7%
M 333249
7.7%
e 333249
7.7%
d 333249
7.7%
u 333249
7.7%
m 333249
7.7%
L 333091
7.7%
Other values (2) 666182
15.4%
ValueCountFrequency (%)
i 20164
15.5%
H 10171
7.8%
g 10171
7.8%
h 10171
7.8%
M 9993
7.7%
e 9993
7.7%
d 9993
7.7%
u 9993
7.7%
m 9993
7.7%
L 9836
7.6%
Other values (2) 19672
15.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 4333407
100.0%
ValueCountFrequency (%)
(unknown) 130150
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
i 666909
15.4%
H 333660
7.7%
g 333660
7.7%
h 333660
7.7%
M 333249
7.7%
e 333249
7.7%
d 333249
7.7%
u 333249
7.7%
m 333249
7.7%
L 333091
7.7%
Other values (2) 666182
15.4%
ValueCountFrequency (%)
i 20164
15.5%
H 10171
7.8%
g 10171
7.8%
h 10171
7.8%
M 9993
7.7%
e 9993
7.7%
d 9993
7.7%
u 9993
7.7%
m 9993
7.7%
L 9836
7.6%
Other values (2) 19672
15.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 4333407
100.0%
ValueCountFrequency (%)
(unknown) 130150
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
i 666909
15.4%
H 333660
7.7%
g 333660
7.7%
h 333660
7.7%
M 333249
7.7%
e 333249
7.7%
d 333249
7.7%
u 333249
7.7%
m 333249
7.7%
L 333091
7.7%
Other values (2) 666182
15.4%
ValueCountFrequency (%)
i 20164
15.5%
H 10171
7.8%
g 10171
7.8%
h 10171
7.8%
M 9993
7.7%
e 9993
7.7%
d 9993
7.7%
u 9993
7.7%
m 9993
7.7%
L 9836
7.6%
Other values (2) 19672
15.1%

promotion_channel
['Text', 'Text']

 Full DatasetStratified Sample
Distinct33
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:14.462990image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length1212
Median length88
Mean length8.6654288.667266667
Min length66

Characters and Unicode

 Full DatasetStratified Sample
Total characters8665428260018
Distinct characters1717
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowOnlineSocial Media
2nd rowSocial MediaOnline
3rd rowOnlineSocial Media
4th rowSocial MediaSocial Media
5th rowOnlineIn-store
ValueCountFrequency (%)
online 333694
25.0%
social 333204
25.0%
media 333204
25.0%
in-store 333102
25.0%
ValueCountFrequency (%)
in-store 10060
25.2%
social 9983
25.0%
media 9983
25.0%
online 9957
24.9%
2025-06-06T02:29:14.803023image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
n 1000490
11.5%
i 1000102
11.5%
e 1000000
11.5%
l 666898
 
7.7%
a 666408
 
7.7%
o 666306
 
7.7%
O 333694
 
3.9%
S 333204
 
3.8%
c 333204
 
3.8%
333204
 
3.8%
Other values (7) 2331918
26.9%
ValueCountFrequency (%)
e 30000
11.5%
n 29974
11.5%
i 29923
11.5%
o 20043
 
7.7%
a 19966
 
7.7%
l 19940
 
7.7%
I 10060
 
3.9%
t 10060
 
3.9%
- 10060
 
3.9%
s 10060
 
3.9%
Other values (7) 69932
26.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 8665428
100.0%
ValueCountFrequency (%)
(unknown) 260018
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
n 1000490
11.5%
i 1000102
11.5%
e 1000000
11.5%
l 666898
 
7.7%
a 666408
 
7.7%
o 666306
 
7.7%
O 333694
 
3.9%
S 333204
 
3.8%
c 333204
 
3.8%
333204
 
3.8%
Other values (7) 2331918
26.9%
ValueCountFrequency (%)
e 30000
11.5%
n 29974
11.5%
i 29923
11.5%
o 20043
 
7.7%
a 19966
 
7.7%
l 19940
 
7.7%
I 10060
 
3.9%
t 10060
 
3.9%
- 10060
 
3.9%
s 10060
 
3.9%
Other values (7) 69932
26.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 8665428
100.0%
ValueCountFrequency (%)
(unknown) 260018
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
n 1000490
11.5%
i 1000102
11.5%
e 1000000
11.5%
l 666898
 
7.7%
a 666408
 
7.7%
o 666306
 
7.7%
O 333694
 
3.9%
S 333204
 
3.8%
c 333204
 
3.8%
333204
 
3.8%
Other values (7) 2331918
26.9%
ValueCountFrequency (%)
e 30000
11.5%
n 29974
11.5%
i 29923
11.5%
o 20043
 
7.7%
a 19966
 
7.7%
l 19940
 
7.7%
I 10060
 
3.9%
t 10060
 
3.9%
- 10060
 
3.9%
s 10060
 
3.9%
Other values (7) 69932
26.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 8665428
100.0%
ValueCountFrequency (%)
(unknown) 260018
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
n 1000490
11.5%
i 1000102
11.5%
e 1000000
11.5%
l 666898
 
7.7%
a 666408
 
7.7%
o 666306
 
7.7%
O 333694
 
3.9%
S 333204
 
3.8%
c 333204
 
3.8%
333204
 
3.8%
Other values (7) 2331918
26.9%
ValueCountFrequency (%)
e 30000
11.5%
n 29974
11.5%
i 29923
11.5%
o 20043
 
7.7%
a 19966
 
7.7%
l 19940
 
7.7%
I 10060
 
3.9%
t 10060
 
3.9%
- 10060
 
3.9%
s 10060
 
3.9%
Other values (7) 69932
26.9%
 Full DatasetStratified Sample
Distinct22
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:14.989494image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length1919
Median length1319
Mean length15.99929216.0068
Min length1313

Characters and Unicode

 Full DatasetStratified Sample
Total characters15999292480204
Distinct characters1515
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowNew CustomersReturning Customers
2nd rowNew CustomersReturning Customers
3rd rowNew CustomersReturning Customers
4th rowReturning CustomersNew Customers
5th rowNew CustomersReturning Customers
ValueCountFrequency (%)
customers 1000000
50.0%
new 500118
25.0%
returning 499882
25.0%
ValueCountFrequency (%)
customers 30000
50.0%
returning 15034
25.1%
new 14966
24.9%
2025-06-06T02:29:15.295071image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 2000000
12.5%
s 2000000
12.5%
u 1499882
9.4%
r 1499882
9.4%
t 1499882
9.4%
C 1000000
6.3%
1000000
6.3%
o 1000000
6.3%
m 1000000
6.3%
n 999764
 
6.2%
Other values (5) 2499882
15.6%
ValueCountFrequency (%)
e 60000
12.5%
s 60000
12.5%
t 45034
9.4%
r 45034
9.4%
u 45034
9.4%
n 30068
6.3%
30000
 
6.2%
m 30000
 
6.2%
o 30000
 
6.2%
C 30000
 
6.2%
Other values (5) 75034
15.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 15999292
100.0%
ValueCountFrequency (%)
(unknown) 480204
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 2000000
12.5%
s 2000000
12.5%
u 1499882
9.4%
r 1499882
9.4%
t 1499882
9.4%
C 1000000
6.3%
1000000
6.3%
o 1000000
6.3%
m 1000000
6.3%
n 999764
 
6.2%
Other values (5) 2499882
15.6%
ValueCountFrequency (%)
e 60000
12.5%
s 60000
12.5%
t 45034
9.4%
r 45034
9.4%
u 45034
9.4%
n 30068
6.3%
30000
 
6.2%
m 30000
 
6.2%
o 30000
 
6.2%
C 30000
 
6.2%
Other values (5) 75034
15.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 15999292
100.0%
ValueCountFrequency (%)
(unknown) 480204
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 2000000
12.5%
s 2000000
12.5%
u 1499882
9.4%
r 1499882
9.4%
t 1499882
9.4%
C 1000000
6.3%
1000000
6.3%
o 1000000
6.3%
m 1000000
6.3%
n 999764
 
6.2%
Other values (5) 2499882
15.6%
ValueCountFrequency (%)
e 60000
12.5%
s 60000
12.5%
t 45034
9.4%
r 45034
9.4%
u 45034
9.4%
n 30068
6.3%
30000
 
6.2%
m 30000
 
6.2%
o 30000
 
6.2%
C 30000
 
6.2%
Other values (5) 75034
15.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 15999292
100.0%
ValueCountFrequency (%)
(unknown) 480204
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 2000000
12.5%
s 2000000
12.5%
u 1499882
9.4%
r 1499882
9.4%
t 1499882
9.4%
C 1000000
6.3%
1000000
6.3%
o 1000000
6.3%
m 1000000
6.3%
n 999764
 
6.2%
Other values (5) 2499882
15.6%
ValueCountFrequency (%)
e 60000
12.5%
s 60000
12.5%
t 45034
9.4%
r 45034
9.4%
u 45034
9.4%
n 30068
6.3%
30000
 
6.2%
m 30000
 
6.2%
o 30000
 
6.2%
C 30000
 
6.2%
Other values (5) 75034
15.6%

customer_zip_code
Real number (ℝ)

 Full DatasetStratified Sample
Distinct8999925539
Distinct (%)9.0%85.1%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean54993.6447754854.25793
 Full DatasetStratified Sample
Minimum1000010003
Maximum9999899985
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:15.475775image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum1000010003
5-th percentile1449114581.95
Q132477.7532239
median5496654559.5
Q37749377304.75
95-th percentile9549795450.1
Maximum9999899985
Range8999889982
Interquartile range (IQR)45015.2545065.75

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation25975.807825945.28587
Coefficient of variation (CV)0.47234199340.4729858145
Kurtosis-1.199859176-1.203477455
Mean54993.6447754854.25793
Median Absolute Deviation (MAD)2250922515.5
Skewness0.000792464580.01518527042
Sum5.499364477 × 10101645627738
Variance674742590.8673157858.7
MonotonicityNot monotonicNot monotonic
2025-06-06T02:29:15.669154image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
41138 27
 
< 0.1%
28225 27
 
< 0.1%
19719 27
 
< 0.1%
25427 27
 
< 0.1%
95120 26
 
< 0.1%
38515 26
 
< 0.1%
54735 26
 
< 0.1%
21109 26
 
< 0.1%
17611 25
 
< 0.1%
82394 25
 
< 0.1%
Other values (89989) 999738
> 99.9%
ValueCountFrequency (%)
98876 5
 
< 0.1%
41905 5
 
< 0.1%
47419 5
 
< 0.1%
31912 5
 
< 0.1%
35444 5
 
< 0.1%
99244 4
 
< 0.1%
36854 4
 
< 0.1%
72876 4
 
< 0.1%
20323 4
 
< 0.1%
13575 4
 
< 0.1%
Other values (25529) 29955
99.9%
ValueCountFrequency (%)
10000 12
< 0.1%
10001 14
< 0.1%
10002 6
< 0.1%
10003 12
< 0.1%
10004 5
 
< 0.1%
ValueCountFrequency (%)
10003 2
< 0.1%
10007 1
< 0.1%
10009 1
< 0.1%
10011 1
< 0.1%
10012 2
< 0.1%
ValueCountFrequency (%)
10003 2
< 0.1%
10007 1
< 0.1%
10009 1
< 0.1%
10011 1
< 0.1%
10012 2
< 0.1%
ValueCountFrequency (%)
10000 12
< 0.1%
10001 14
< 0.1%
10002 6
< 0.1%
10003 12
< 0.1%
10004 5
 
< 0.1%

customer_city
['Text', 'Text']

 Full DatasetStratified Sample
Distinct44
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:15.844765image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length66
Median length66
Mean length66
Min length66

Characters and Unicode

 Full DatasetStratified Sample
Total characters6000000180000
Distinct characters88
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowCity DCity A
2nd rowCity ACity A
3rd rowCity BCity D
4th rowCity ACity D
5th rowCity BCity D
ValueCountFrequency (%)
city 1000000
50.0%
b 250788
 
12.5%
c 249955
 
12.5%
a 249698
 
12.5%
d 249559
 
12.5%
ValueCountFrequency (%)
city 30000
50.0%
c 7608
 
12.7%
a 7531
 
12.6%
b 7517
 
12.5%
d 7344
 
12.2%
2025-06-06T02:29:16.079312image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
C 1249955
20.8%
i 1000000
16.7%
t 1000000
16.7%
y 1000000
16.7%
1000000
16.7%
B 250788
 
4.2%
A 249698
 
4.2%
D 249559
 
4.2%
ValueCountFrequency (%)
C 37608
20.9%
i 30000
16.7%
t 30000
16.7%
y 30000
16.7%
30000
16.7%
A 7531
 
4.2%
B 7517
 
4.2%
D 7344
 
4.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 6000000
100.0%
ValueCountFrequency (%)
(unknown) 180000
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
C 1249955
20.8%
i 1000000
16.7%
t 1000000
16.7%
y 1000000
16.7%
1000000
16.7%
B 250788
 
4.2%
A 249698
 
4.2%
D 249559
 
4.2%
ValueCountFrequency (%)
C 37608
20.9%
i 30000
16.7%
t 30000
16.7%
y 30000
16.7%
30000
16.7%
A 7531
 
4.2%
B 7517
 
4.2%
D 7344
 
4.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 6000000
100.0%
ValueCountFrequency (%)
(unknown) 180000
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
C 1249955
20.8%
i 1000000
16.7%
t 1000000
16.7%
y 1000000
16.7%
1000000
16.7%
B 250788
 
4.2%
A 249698
 
4.2%
D 249559
 
4.2%
ValueCountFrequency (%)
C 37608
20.9%
i 30000
16.7%
t 30000
16.7%
y 30000
16.7%
30000
16.7%
A 7531
 
4.2%
B 7517
 
4.2%
D 7344
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 6000000
100.0%
ValueCountFrequency (%)
(unknown) 180000
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
C 1249955
20.8%
i 1000000
16.7%
t 1000000
16.7%
y 1000000
16.7%
1000000
16.7%
B 250788
 
4.2%
A 249698
 
4.2%
D 249559
 
4.2%
ValueCountFrequency (%)
C 37608
20.9%
i 30000
16.7%
t 30000
16.7%
y 30000
16.7%
30000
16.7%
A 7531
 
4.2%
B 7517
 
4.2%
D 7344
 
4.1%

customer_state
['Text', 'Text']

 Full DatasetStratified Sample
Distinct33
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:16.235170image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length77
Median length77
Mean length77
Min length77

Characters and Unicode

 Full DatasetStratified Sample
Total characters7000000210000
Distinct characters88
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowState YState Y
2nd rowState XState Z
3rd rowState XState Z
4th rowState YState Y
5th rowState ZState Z
ValueCountFrequency (%)
state 1000000
50.0%
z 333674
 
16.7%
x 333196
 
16.7%
y 333130
 
16.7%
ValueCountFrequency (%)
state 30000
50.0%
x 10125
 
16.9%
z 10078
 
16.8%
y 9797
 
16.3%
2025-06-06T02:29:16.512930image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
t 2000000
28.6%
S 1000000
14.3%
a 1000000
14.3%
e 1000000
14.3%
1000000
14.3%
Z 333674
 
4.8%
X 333196
 
4.8%
Y 333130
 
4.8%
ValueCountFrequency (%)
t 60000
28.6%
S 30000
14.3%
a 30000
14.3%
e 30000
14.3%
30000
14.3%
X 10125
 
4.8%
Z 10078
 
4.8%
Y 9797
 
4.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 7000000
100.0%
ValueCountFrequency (%)
(unknown) 210000
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
t 2000000
28.6%
S 1000000
14.3%
a 1000000
14.3%
e 1000000
14.3%
1000000
14.3%
Z 333674
 
4.8%
X 333196
 
4.8%
Y 333130
 
4.8%
ValueCountFrequency (%)
t 60000
28.6%
S 30000
14.3%
a 30000
14.3%
e 30000
14.3%
30000
14.3%
X 10125
 
4.8%
Z 10078
 
4.8%
Y 9797
 
4.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 7000000
100.0%
ValueCountFrequency (%)
(unknown) 210000
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
t 2000000
28.6%
S 1000000
14.3%
a 1000000
14.3%
e 1000000
14.3%
1000000
14.3%
Z 333674
 
4.8%
X 333196
 
4.8%
Y 333130
 
4.8%
ValueCountFrequency (%)
t 60000
28.6%
S 30000
14.3%
a 30000
14.3%
e 30000
14.3%
30000
14.3%
X 10125
 
4.8%
Z 10078
 
4.8%
Y 9797
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 7000000
100.0%
ValueCountFrequency (%)
(unknown) 210000
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
t 2000000
28.6%
S 1000000
14.3%
a 1000000
14.3%
e 1000000
14.3%
1000000
14.3%
Z 333674
 
4.8%
X 333196
 
4.8%
Y 333130
 
4.8%
ValueCountFrequency (%)
t 60000
28.6%
S 30000
14.3%
a 30000
14.3%
e 30000
14.3%
30000
14.3%
X 10125
 
4.8%
Z 10078
 
4.8%
Y 9797
 
4.7%

store_zip_code
Real number (ℝ)

 Full DatasetStratified Sample
Distinct8999925570
Distinct (%)9.0%85.2%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean54972.7667154951.9952
 Full DatasetStratified Sample
Minimum1000010000
Maximum9999899994
Zeros00
Zeros (%)0.0%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:16.669742image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum1000010000
5-th percentile1448814347.8
Q13247332330.25
median5496155001
Q37745177483.25
95-th percentile9547095507.15
Maximum9999899994
Range8999889994
Interquartile range (IQR)4497845153

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation25981.4831426055.24544
Coefficient of variation (CV)0.47262462290.4741455765
Kurtosis-1.200166165-1.2041705
Mean54972.7667154951.9952
Median Absolute Deviation (MAD)22489.522594
Skewness-0.0001039626203-0.001870781196
Sum5.497276671 × 10101648559856
Variance675037466.1678875815
MonotonicityNot monotonicNot monotonic
2025-06-06T02:29:16.869001image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20956 28
 
< 0.1%
29159 27
 
< 0.1%
59836 26
 
< 0.1%
54696 26
 
< 0.1%
92910 26
 
< 0.1%
43369 26
 
< 0.1%
26386 26
 
< 0.1%
90024 26
 
< 0.1%
27134 26
 
< 0.1%
20477 26
 
< 0.1%
Other values (89989) 999737
> 99.9%
ValueCountFrequency (%)
72397 5
 
< 0.1%
65367 5
 
< 0.1%
74371 5
 
< 0.1%
18725 4
 
< 0.1%
96681 4
 
< 0.1%
45189 4
 
< 0.1%
13087 4
 
< 0.1%
62639 4
 
< 0.1%
47243 4
 
< 0.1%
77335 4
 
< 0.1%
Other values (25560) 29957
99.9%
ValueCountFrequency (%)
10000 11
< 0.1%
10001 6
 
< 0.1%
10002 15
< 0.1%
10003 8
< 0.1%
10004 14
< 0.1%
ValueCountFrequency (%)
10000 1
< 0.1%
10005 2
< 0.1%
10006 1
< 0.1%
10007 1
< 0.1%
10008 1
< 0.1%
ValueCountFrequency (%)
10000 1
< 0.1%
10005 2
< 0.1%
10006 1
< 0.1%
10007 1
< 0.1%
10008 1
< 0.1%
ValueCountFrequency (%)
10000 11
< 0.1%
10001 6
 
< 0.1%
10002 15
0.1%
10003 8
< 0.1%
10004 14
< 0.1%

store_city
['Text', 'Text']

 Full DatasetStratified Sample
Distinct44
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:17.045922image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length66
Median length66
Mean length66
Min length66

Characters and Unicode

 Full DatasetStratified Sample
Total characters6000000180000
Distinct characters88
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowCity DCity A
2nd rowCity CCity C
3rd rowCity ACity C
4th rowCity BCity C
5th rowCity CCity A
ValueCountFrequency (%)
city 1000000
50.0%
d 250315
 
12.5%
c 250177
 
12.5%
b 249965
 
12.5%
a 249543
 
12.5%
ValueCountFrequency (%)
city 30000
50.0%
a 7602
 
12.7%
d 7567
 
12.6%
c 7500
 
12.5%
b 7331
 
12.2%
2025-06-06T02:29:17.288470image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
C 1250177
20.8%
i 1000000
16.7%
t 1000000
16.7%
y 1000000
16.7%
1000000
16.7%
D 250315
 
4.2%
B 249965
 
4.2%
A 249543
 
4.2%
ValueCountFrequency (%)
C 37500
20.8%
i 30000
16.7%
t 30000
16.7%
y 30000
16.7%
30000
16.7%
A 7602
 
4.2%
D 7567
 
4.2%
B 7331
 
4.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 6000000
100.0%
ValueCountFrequency (%)
(unknown) 180000
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
C 1250177
20.8%
i 1000000
16.7%
t 1000000
16.7%
y 1000000
16.7%
1000000
16.7%
D 250315
 
4.2%
B 249965
 
4.2%
A 249543
 
4.2%
ValueCountFrequency (%)
C 37500
20.8%
i 30000
16.7%
t 30000
16.7%
y 30000
16.7%
30000
16.7%
A 7602
 
4.2%
D 7567
 
4.2%
B 7331
 
4.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 6000000
100.0%
ValueCountFrequency (%)
(unknown) 180000
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
C 1250177
20.8%
i 1000000
16.7%
t 1000000
16.7%
y 1000000
16.7%
1000000
16.7%
D 250315
 
4.2%
B 249965
 
4.2%
A 249543
 
4.2%
ValueCountFrequency (%)
C 37500
20.8%
i 30000
16.7%
t 30000
16.7%
y 30000
16.7%
30000
16.7%
A 7602
 
4.2%
D 7567
 
4.2%
B 7331
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 6000000
100.0%
ValueCountFrequency (%)
(unknown) 180000
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
C 1250177
20.8%
i 1000000
16.7%
t 1000000
16.7%
y 1000000
16.7%
1000000
16.7%
D 250315
 
4.2%
B 249965
 
4.2%
A 249543
 
4.2%
ValueCountFrequency (%)
C 37500
20.8%
i 30000
16.7%
t 30000
16.7%
y 30000
16.7%
30000
16.7%
A 7602
 
4.2%
D 7567
 
4.2%
B 7331
 
4.1%

store_state
['Text', 'Text']

 Full DatasetStratified Sample
Distinct33
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:17.452189image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length77
Median length77
Mean length77
Min length77

Characters and Unicode

 Full DatasetStratified Sample
Total characters7000000210000
Distinct characters88
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowState YState Y
2nd rowState XState Y
3rd rowState YState X
4th rowState ZState Z
5th rowState XState Z
ValueCountFrequency (%)
state 1000000
50.0%
x 333702
 
16.7%
z 333602
 
16.7%
y 332696
 
16.6%
ValueCountFrequency (%)
state 30000
50.0%
z 10041
 
16.7%
x 10009
 
16.7%
y 9950
 
16.6%
2025-06-06T02:29:17.700412image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
t 2000000
28.6%
S 1000000
14.3%
a 1000000
14.3%
e 1000000
14.3%
1000000
14.3%
X 333702
 
4.8%
Z 333602
 
4.8%
Y 332696
 
4.8%
ValueCountFrequency (%)
t 60000
28.6%
S 30000
14.3%
a 30000
14.3%
e 30000
14.3%
30000
14.3%
Z 10041
 
4.8%
X 10009
 
4.8%
Y 9950
 
4.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 7000000
100.0%
ValueCountFrequency (%)
(unknown) 210000
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
t 2000000
28.6%
S 1000000
14.3%
a 1000000
14.3%
e 1000000
14.3%
1000000
14.3%
X 333702
 
4.8%
Z 333602
 
4.8%
Y 332696
 
4.8%
ValueCountFrequency (%)
t 60000
28.6%
S 30000
14.3%
a 30000
14.3%
e 30000
14.3%
30000
14.3%
Z 10041
 
4.8%
X 10009
 
4.8%
Y 9950
 
4.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 7000000
100.0%
ValueCountFrequency (%)
(unknown) 210000
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
t 2000000
28.6%
S 1000000
14.3%
a 1000000
14.3%
e 1000000
14.3%
1000000
14.3%
X 333702
 
4.8%
Z 333602
 
4.8%
Y 332696
 
4.8%
ValueCountFrequency (%)
t 60000
28.6%
S 30000
14.3%
a 30000
14.3%
e 30000
14.3%
30000
14.3%
Z 10041
 
4.8%
X 10009
 
4.8%
Y 9950
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 7000000
100.0%
ValueCountFrequency (%)
(unknown) 210000
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
t 2000000
28.6%
S 1000000
14.3%
a 1000000
14.3%
e 1000000
14.3%
1000000
14.3%
X 333702
 
4.8%
Z 333602
 
4.8%
Y 332696
 
4.8%
ValueCountFrequency (%)
t 60000
28.6%
S 30000
14.3%
a 30000
14.3%
e 30000
14.3%
30000
14.3%
Z 10041
 
4.8%
X 10009
 
4.8%
Y 9950
 
4.7%

distance_to_store
Real number (ℝ)

 Full DatasetStratified Sample
Distinct100019505
Distinct (%)1.0%31.7%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean49.9791092449.60062667
 Full DatasetStratified Sample
Minimum00.01
Maximum100100
Zeros620
Zeros (%)< 0.1%0.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:17.857740image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum00.01
5-th percentile5.034.74
Q124.9724.55
median49.9649.55
Q374.9574.54
95-th percentile94.9894.8805
Maximum100100
Range10099.99
Interquartile range (IQR)49.9849.99

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation28.8609891128.88565901
Coefficient of variation (CV)0.57746105430.5823647996
Kurtosis-1.200199633-1.19802704
Mean49.9791092449.60062667
Median Absolute Deviation (MAD)24.9925
Skewness0.0012182864680.01292473608
Sum49979109.241488018.8
Variance832.9566927834.3812965
MonotonicityNot monotonicNot monotonic
2025-06-06T02:29:18.084591image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.05 139
 
< 0.1%
0.01 138
 
< 0.1%
9.68 138
 
< 0.1%
30.79 136
 
< 0.1%
31.27 135
 
< 0.1%
22.61 134
 
< 0.1%
40.84 134
 
< 0.1%
78.41 134
 
< 0.1%
82.37 133
 
< 0.1%
89.9 133
 
< 0.1%
Other values (9991) 998646
99.9%
ValueCountFrequency (%)
68.54 11
 
< 0.1%
59.82 11
 
< 0.1%
93.82 11
 
< 0.1%
76.63 11
 
< 0.1%
15.31 10
 
< 0.1%
80.79 10
 
< 0.1%
81.05 10
 
< 0.1%
47.46 10
 
< 0.1%
13.25 10
 
< 0.1%
30.27 10
 
< 0.1%
Other values (9495) 29896
99.7%
ValueCountFrequency (%)
0 62
< 0.1%
0.01 138
< 0.1%
0.02 88
< 0.1%
0.03 113
< 0.1%
0.04 86
< 0.1%
ValueCountFrequency (%)
0.01 4
< 0.1%
0.02 2
< 0.1%
0.03 2
< 0.1%
0.04 2
< 0.1%
0.05 4
< 0.1%
ValueCountFrequency (%)
0.01 4
< 0.1%
0.02 2
< 0.1%
0.03 2
< 0.1%
0.04 2
< 0.1%
0.05 4
< 0.1%
ValueCountFrequency (%)
0 62
0.2%
0.01 138
0.5%
0.02 88
0.3%
0.03 113
0.4%
0.04 86
0.3%

holiday_season
['Text', 'Text']

 Full DatasetStratified Sample
Distinct22
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:18.273577image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length33
Median length32
Mean length2.5002142.498866667
Min length22

Characters and Unicode

 Full DatasetStratified Sample
Total characters250021474966
Distinct characters55
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowNoYes
2nd rowNoNo
3rd rowYesYes
4th rowYesYes
5th rowYesNo
ValueCountFrequency (%)
yes 500214
50.0%
no 499786
50.0%
ValueCountFrequency (%)
no 15034
50.1%
yes 14966
49.9%
2025-06-06T02:29:18.543697image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
Y 500214
20.0%
e 500214
20.0%
s 500214
20.0%
N 499786
20.0%
o 499786
20.0%
ValueCountFrequency (%)
N 15034
20.1%
o 15034
20.1%
Y 14966
20.0%
e 14966
20.0%
s 14966
20.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2500214
100.0%
ValueCountFrequency (%)
(unknown) 74966
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
Y 500214
20.0%
e 500214
20.0%
s 500214
20.0%
N 499786
20.0%
o 499786
20.0%
ValueCountFrequency (%)
N 15034
20.1%
o 15034
20.1%
Y 14966
20.0%
e 14966
20.0%
s 14966
20.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2500214
100.0%
ValueCountFrequency (%)
(unknown) 74966
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
Y 500214
20.0%
e 500214
20.0%
s 500214
20.0%
N 499786
20.0%
o 499786
20.0%
ValueCountFrequency (%)
N 15034
20.1%
o 15034
20.1%
Y 14966
20.0%
e 14966
20.0%
s 14966
20.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2500214
100.0%
ValueCountFrequency (%)
(unknown) 74966
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
Y 500214
20.0%
e 500214
20.0%
s 500214
20.0%
N 499786
20.0%
o 499786
20.0%
ValueCountFrequency (%)
N 15034
20.1%
o 15034
20.1%
Y 14966
20.0%
e 14966
20.0%
s 14966
20.0%

season
['Text', 'Text']

 Full DatasetStratified Sample
Distinct44
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:18.750575image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length66
Median length66
Mean length5.5003225.495533333
Min length44

Characters and Unicode

 Full DatasetStratified Sample
Total characters5500322164866
Distinct characters1414
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowSpringSpring
2nd rowSummerWinter
3rd rowWinterSpring
4th rowWinterWinter
5th rowSummerSummer
ValueCountFrequency (%)
winter 250307
25.0%
spring 250169
25.0%
fall 249839
25.0%
summer 249685
25.0%
ValueCountFrequency (%)
fall 7567
25.2%
spring 7515
25.1%
summer 7499
25.0%
winter 7419
24.7%
2025-06-06T02:29:19.091982image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
r 750161
13.6%
i 500476
9.1%
n 500476
9.1%
e 499992
9.1%
S 499854
9.1%
l 499678
9.1%
m 499370
9.1%
W 250307
 
4.6%
t 250307
 
4.6%
p 250169
 
4.5%
Other values (4) 999532
18.2%
ValueCountFrequency (%)
r 22433
13.6%
l 15134
9.2%
S 15014
9.1%
m 14998
9.1%
i 14934
9.1%
n 14934
9.1%
e 14918
9.0%
F 7567
 
4.6%
a 7567
 
4.6%
p 7515
 
4.6%
Other values (4) 29852
18.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 5500322
100.0%
ValueCountFrequency (%)
(unknown) 164866
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
r 750161
13.6%
i 500476
9.1%
n 500476
9.1%
e 499992
9.1%
S 499854
9.1%
l 499678
9.1%
m 499370
9.1%
W 250307
 
4.6%
t 250307
 
4.6%
p 250169
 
4.5%
Other values (4) 999532
18.2%
ValueCountFrequency (%)
r 22433
13.6%
l 15134
9.2%
S 15014
9.1%
m 14998
9.1%
i 14934
9.1%
n 14934
9.1%
e 14918
9.0%
F 7567
 
4.6%
a 7567
 
4.6%
p 7515
 
4.6%
Other values (4) 29852
18.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 5500322
100.0%
ValueCountFrequency (%)
(unknown) 164866
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
r 750161
13.6%
i 500476
9.1%
n 500476
9.1%
e 499992
9.1%
S 499854
9.1%
l 499678
9.1%
m 499370
9.1%
W 250307
 
4.6%
t 250307
 
4.6%
p 250169
 
4.5%
Other values (4) 999532
18.2%
ValueCountFrequency (%)
r 22433
13.6%
l 15134
9.2%
S 15014
9.1%
m 14998
9.1%
i 14934
9.1%
n 14934
9.1%
e 14918
9.0%
F 7567
 
4.6%
a 7567
 
4.6%
p 7515
 
4.6%
Other values (4) 29852
18.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 5500322
100.0%
ValueCountFrequency (%)
(unknown) 164866
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
r 750161
13.6%
i 500476
9.1%
n 500476
9.1%
e 499992
9.1%
S 499854
9.1%
l 499678
9.1%
m 499370
9.1%
W 250307
 
4.6%
t 250307
 
4.6%
p 250169
 
4.5%
Other values (4) 999532
18.2%
ValueCountFrequency (%)
r 22433
13.6%
l 15134
9.2%
S 15014
9.1%
m 14998
9.1%
i 14934
9.1%
n 14934
9.1%
e 14918
9.0%
F 7567
 
4.6%
a 7567
 
4.6%
p 7515
 
4.6%
Other values (4) 29852
18.1%

weekend
['Text', 'Text']

 Full DatasetStratified Sample
Distinct22
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:19.247242image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length33
Median length22
Mean length2.4993332.498233333
Min length22

Characters and Unicode

 Full DatasetStratified Sample
Total characters249933374947
Distinct characters55
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowYesYes
2nd rowYesNo
3rd rowYesYes
4th rowNoNo
5th rowYesNo
ValueCountFrequency (%)
no 500667
50.1%
yes 499333
49.9%
ValueCountFrequency (%)
no 15053
50.2%
yes 14947
49.8%
2025-06-06T02:29:19.518778image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
N 500667
20.0%
o 500667
20.0%
Y 499333
20.0%
e 499333
20.0%
s 499333
20.0%
ValueCountFrequency (%)
N 15053
20.1%
o 15053
20.1%
Y 14947
19.9%
e 14947
19.9%
s 14947
19.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2499333
100.0%
ValueCountFrequency (%)
(unknown) 74947
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N 500667
20.0%
o 500667
20.0%
Y 499333
20.0%
e 499333
20.0%
s 499333
20.0%
ValueCountFrequency (%)
N 15053
20.1%
o 15053
20.1%
Y 14947
19.9%
e 14947
19.9%
s 14947
19.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2499333
100.0%
ValueCountFrequency (%)
(unknown) 74947
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N 500667
20.0%
o 500667
20.0%
Y 499333
20.0%
e 499333
20.0%
s 499333
20.0%
ValueCountFrequency (%)
N 15053
20.1%
o 15053
20.1%
Y 14947
19.9%
e 14947
19.9%
s 14947
19.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2499333
100.0%
ValueCountFrequency (%)
(unknown) 74947
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N 500667
20.0%
o 500667
20.0%
Y 499333
20.0%
e 499333
20.0%
s 499333
20.0%
ValueCountFrequency (%)
N 15053
20.1%
o 15053
20.1%
Y 14947
19.9%
e 14947
19.9%
s 14947
19.9%

customer_support_calls
Real number (ℝ)

 Full DatasetStratified Sample
Distinct2020
Distinct (%)< 0.1%0.1%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean9.4962699.4736
 Full DatasetStratified Sample
Minimum00
Maximum1919
Zeros497551525
Zeros (%)5.0%5.1%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:19.626614image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum00
5-th percentile10
Q144
median99
Q31414
95-th percentile1819
Maximum1919
Range1919
Interquartile range (IQR)1010

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation5.7612327915.769172022
Coefficient of variation (CV)0.6066838240.608973571
Kurtosis-1.204539564-1.203435083
Mean9.4962699.4736
Median Absolute Deviation (MAD)55
Skewness0.0015720255060.000146728086
Sum9496269284208
Variance33.1918032733.28334582
MonotonicityNot monotonicNot monotonic
2025-06-06T02:29:19.784833image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
3 50608
 
5.1%
8 50350
 
5.0%
4 50334
 
5.0%
12 50312
 
5.0%
2 50158
 
5.0%
11 50151
 
5.0%
16 50087
 
5.0%
13 50074
 
5.0%
9 50053
 
5.0%
7 50050
 
5.0%
Other values (10) 497823
49.8%
ValueCountFrequency (%)
12 1606
 
5.4%
2 1577
 
5.3%
13 1543
 
5.1%
9 1535
 
5.1%
19 1531
 
5.1%
0 1525
 
5.1%
3 1517
 
5.1%
17 1502
 
5.0%
1 1501
 
5.0%
7 1500
 
5.0%
Other values (10) 14663
48.9%
ValueCountFrequency (%)
0 49755
5.0%
1 49530
5.0%
2 50158
5.0%
3 50608
5.1%
4 50334
5.0%
ValueCountFrequency (%)
0 1525
5.1%
1 1501
5.0%
2 1577
5.3%
3 1517
5.1%
4 1450
4.8%
ValueCountFrequency (%)
0 1525
0.2%
1 1501
0.2%
2 1577
0.2%
3 1517
0.2%
4 1450
0.1%
ValueCountFrequency (%)
0 49755
165.9%
1 49530
165.1%
2 50158
167.2%
3 50608
168.7%
4 50334
167.8%

email_subscriptions
['Text', 'Text']

 Full DatasetStratified Sample
Distinct22
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:19.979954image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length33
Median length22
Mean length2.4999382.498733333
Min length22

Characters and Unicode

 Full DatasetStratified Sample
Total characters249993874962
Distinct characters55
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowNoNo
2nd rowNoYes
3rd rowYesNo
4th rowNoYes
5th rowNoYes
ValueCountFrequency (%)
no 500062
50.0%
yes 499938
50.0%
ValueCountFrequency (%)
no 15038
50.1%
yes 14962
49.9%
2025-06-06T02:29:20.251151image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
N 500062
20.0%
o 500062
20.0%
Y 499938
20.0%
e 499938
20.0%
s 499938
20.0%
ValueCountFrequency (%)
N 15038
20.1%
o 15038
20.1%
Y 14962
20.0%
e 14962
20.0%
s 14962
20.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2499938
100.0%
ValueCountFrequency (%)
(unknown) 74962
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
N 500062
20.0%
o 500062
20.0%
Y 499938
20.0%
e 499938
20.0%
s 499938
20.0%
ValueCountFrequency (%)
N 15038
20.1%
o 15038
20.1%
Y 14962
20.0%
e 14962
20.0%
s 14962
20.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2499938
100.0%
ValueCountFrequency (%)
(unknown) 74962
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
N 500062
20.0%
o 500062
20.0%
Y 499938
20.0%
e 499938
20.0%
s 499938
20.0%
ValueCountFrequency (%)
N 15038
20.1%
o 15038
20.1%
Y 14962
20.0%
e 14962
20.0%
s 14962
20.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2499938
100.0%
ValueCountFrequency (%)
(unknown) 74962
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
N 500062
20.0%
o 500062
20.0%
Y 499938
20.0%
e 499938
20.0%
s 499938
20.0%
ValueCountFrequency (%)
N 15038
20.1%
o 15038
20.1%
Y 14962
20.0%
e 14962
20.0%
s 14962
20.0%

app_usage
['Text', 'Text']

 Full DatasetStratified Sample
Distinct33
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:20.444372image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length66
Median length44
Mean length4.3342994.3237
Min length33

Characters and Unicode

 Full DatasetStratified Sample
Total characters4334299129711
Distinct characters1212
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowHighMedium
2nd rowHighMedium
3rd rowLowMedium
4th rowLowMedium
5th rowMediumLow
ValueCountFrequency (%)
medium 333822
33.4%
low 333345
33.3%
high 332833
33.3%
ValueCountFrequency (%)
low 10143
33.8%
high 9930
33.1%
medium 9927
33.1%
2025-06-06T02:29:20.777891image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 666655
15.4%
M 333822
7.7%
e 333822
7.7%
d 333822
7.7%
u 333822
7.7%
m 333822
7.7%
L 333345
7.7%
o 333345
7.7%
w 333345
7.7%
H 332833
7.7%
Other values (2) 665666
15.4%
ValueCountFrequency (%)
i 19857
15.3%
L 10143
7.8%
w 10143
7.8%
o 10143
7.8%
H 9930
7.7%
g 9930
7.7%
h 9930
7.7%
M 9927
7.7%
e 9927
7.7%
d 9927
7.7%
Other values (2) 19854
15.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 4334299
100.0%
ValueCountFrequency (%)
(unknown) 129711
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
i 666655
15.4%
M 333822
7.7%
e 333822
7.7%
d 333822
7.7%
u 333822
7.7%
m 333822
7.7%
L 333345
7.7%
o 333345
7.7%
w 333345
7.7%
H 332833
7.7%
Other values (2) 665666
15.4%
ValueCountFrequency (%)
i 19857
15.3%
L 10143
7.8%
w 10143
7.8%
o 10143
7.8%
H 9930
7.7%
g 9930
7.7%
h 9930
7.7%
M 9927
7.7%
e 9927
7.7%
d 9927
7.7%
Other values (2) 19854
15.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 4334299
100.0%
ValueCountFrequency (%)
(unknown) 129711
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
i 666655
15.4%
M 333822
7.7%
e 333822
7.7%
d 333822
7.7%
u 333822
7.7%
m 333822
7.7%
L 333345
7.7%
o 333345
7.7%
w 333345
7.7%
H 332833
7.7%
Other values (2) 665666
15.4%
ValueCountFrequency (%)
i 19857
15.3%
L 10143
7.8%
w 10143
7.8%
o 10143
7.8%
H 9930
7.7%
g 9930
7.7%
h 9930
7.7%
M 9927
7.7%
e 9927
7.7%
d 9927
7.7%
Other values (2) 19854
15.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 4334299
100.0%
ValueCountFrequency (%)
(unknown) 129711
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
i 666655
15.4%
M 333822
7.7%
e 333822
7.7%
d 333822
7.7%
u 333822
7.7%
m 333822
7.7%
L 333345
7.7%
o 333345
7.7%
w 333345
7.7%
H 332833
7.7%
Other values (2) 665666
15.4%
ValueCountFrequency (%)
i 19857
15.3%
L 10143
7.8%
w 10143
7.8%
o 10143
7.8%
H 9930
7.7%
g 9930
7.7%
h 9930
7.7%
M 9927
7.7%
e 9927
7.7%
d 9927
7.7%
Other values (2) 19854
15.3%

website_visits
Real number (ℝ)

 Full DatasetStratified Sample
Distinct100100
Distinct (%)< 0.1%0.3%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean49.51295149.5615
 Full DatasetStratified Sample
Minimum00
Maximum9999
Zeros10111293
Zeros (%)1.0%1.0%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:20.950658image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum00
5-th percentile44
Q12525
median5050
Q37575
95-th percentile9594
Maximum9999
Range9999
Interquartile range (IQR)5050

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation28.8697769928.77939123
Coefficient of variation (CV)0.58307526440.5806803916
Kurtosis-1.199464505-1.192315994
Mean49.51295149.5615
Median Absolute Deviation (MAD)2525
Skewness-0.0006306812576-0.00604114859
Sum495129511486845
Variance833.4640237828.2533595
MonotonicityNot monotonicNot monotonic
2025-06-06T02:29:21.163110image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
58 10304
 
1.0%
95 10250
 
1.0%
50 10235
 
1.0%
62 10177
 
1.0%
45 10175
 
1.0%
13 10166
 
1.0%
38 10160
 
1.0%
84 10147
 
1.0%
98 10136
 
1.0%
93 10132
 
1.0%
Other values (90) 898118
89.8%
ValueCountFrequency (%)
38 331
 
1.1%
52 329
 
1.1%
56 328
 
1.1%
45 328
 
1.1%
53 326
 
1.1%
14 324
 
1.1%
58 324
 
1.1%
3 323
 
1.1%
42 323
 
1.1%
96 322
 
1.1%
Other values (90) 26742
89.1%
ValueCountFrequency (%)
0 10111
1.0%
1 9997
1.0%
2 9933
1.0%
3 10007
1.0%
4 9969
1.0%
ValueCountFrequency (%)
0 293
1.0%
1 304
1.0%
2 301
1.0%
3 323
1.1%
4 280
0.9%
ValueCountFrequency (%)
0 293
< 0.1%
1 304
< 0.1%
2 301
< 0.1%
3 323
< 0.1%
4 280
< 0.1%
ValueCountFrequency (%)
0 10111
33.7%
1 9997
33.3%
2 9933
33.1%
3 10007
33.4%
4 9969
33.2%

social_media_engagement
['Text', 'Text']

 Full DatasetStratified Sample
Distinct33
Distinct (%)< 0.1%< 0.1%
Missing00
Missing (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:21.411918image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Length

 Full DatasetStratified Sample
Max length66
Median length44
Mean length4.3320574.325733333
Min length33

Characters and Unicode

 Full DatasetStratified Sample
Total characters4332057129772
Distinct characters1212
Distinct categories11 ?
Distinct scripts11 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

 Full DatasetStratified Sample
Unique00 ?
Unique (%)0.0%0.0%

Sample

 Full DatasetStratified Sample
1st rowHighMedium
2nd rowMediumMedium
3rd rowMediumMedium
4th rowLowHigh
5th rowLowMedium
ValueCountFrequency (%)
low 334073
33.4%
medium 333065
33.3%
high 332862
33.3%
ValueCountFrequency (%)
low 10084
33.6%
high 9988
33.3%
medium 9928
33.1%
2025-06-06T02:29:21.751370image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 665927
15.4%
L 334073
7.7%
w 334073
7.7%
o 334073
7.7%
M 333065
7.7%
e 333065
7.7%
d 333065
7.7%
u 333065
7.7%
m 333065
7.7%
H 332862
7.7%
Other values (2) 665724
15.4%
ValueCountFrequency (%)
i 19916
15.3%
L 10084
7.8%
w 10084
7.8%
o 10084
7.8%
H 9988
7.7%
g 9988
7.7%
h 9988
7.7%
M 9928
7.7%
e 9928
7.7%
d 9928
7.7%
Other values (2) 19856
15.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 4332057
100.0%
ValueCountFrequency (%)
(unknown) 129772
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
i 665927
15.4%
L 334073
7.7%
w 334073
7.7%
o 334073
7.7%
M 333065
7.7%
e 333065
7.7%
d 333065
7.7%
u 333065
7.7%
m 333065
7.7%
H 332862
7.7%
Other values (2) 665724
15.4%
ValueCountFrequency (%)
i 19916
15.3%
L 10084
7.8%
w 10084
7.8%
o 10084
7.8%
H 9988
7.7%
g 9988
7.7%
h 9988
7.7%
M 9928
7.7%
e 9928
7.7%
d 9928
7.7%
Other values (2) 19856
15.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 4332057
100.0%
ValueCountFrequency (%)
(unknown) 129772
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
i 665927
15.4%
L 334073
7.7%
w 334073
7.7%
o 334073
7.7%
M 333065
7.7%
e 333065
7.7%
d 333065
7.7%
u 333065
7.7%
m 333065
7.7%
H 332862
7.7%
Other values (2) 665724
15.4%
ValueCountFrequency (%)
i 19916
15.3%
L 10084
7.8%
w 10084
7.8%
o 10084
7.8%
H 9988
7.7%
g 9988
7.7%
h 9988
7.7%
M 9928
7.7%
e 9928
7.7%
d 9928
7.7%
Other values (2) 19856
15.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 4332057
100.0%
ValueCountFrequency (%)
(unknown) 129772
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
i 665927
15.4%
L 334073
7.7%
w 334073
7.7%
o 334073
7.7%
M 333065
7.7%
e 333065
7.7%
d 333065
7.7%
u 333065
7.7%
m 333065
7.7%
H 332862
7.7%
Other values (2) 665724
15.4%
ValueCountFrequency (%)
i 19916
15.3%
L 10084
7.8%
w 10084
7.8%
o 10084
7.8%
H 9988
7.7%
g 9988
7.7%
h 9988
7.7%
M 9928
7.7%
e 9928
7.7%
d 9928
7.7%
Other values (2) 19856
15.3%

days_since_last_purchase
Real number (ℝ)

 Full DatasetStratified Sample
Distinct365365
Distinct (%)< 0.1%1.2%
Missing00
Missing (%)0.0%0.0%
Infinite00
Infinite (%)0.0%0.0%
Mean182.027559182.3302
 Full DatasetStratified Sample
Minimum00
Maximum364364
Zeros276885
Zeros (%)0.3%0.3%
Negative00
Negative (%)0.0%0.0%
Memory size7.6 MiB234.5 KiB
2025-06-06T02:29:21.907964image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/

Quantile statistics

 Full DatasetStratified Sample
Minimum00
5-th percentile1819
Q19192
median182182
Q3273274
95-th percentile346346
Maximum364364
Range364364
Interquartile range (IQR)182182

Descriptive statistics

 Full DatasetStratified Sample
Standard deviation105.3645979105.2194266
Coefficient of variation (CV)0.57883871230.5770817266
Kurtosis-1.199912738-1.196932124
Mean182.027559182.3302
Median Absolute Deviation (MAD)9191
Skewness-0.00055431320910.0001539046339
Sum1820275595469906
Variance11101.6984811071.12774
MonotonicityNot monotonicNot monotonic
2025-06-06T02:29:22.104168image/svg+xmlMatplotlib v3.10.0, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
53 2916
 
0.3%
72 2890
 
0.3%
98 2888
 
0.3%
252 2869
 
0.3%
364 2867
 
0.3%
6 2862
 
0.3%
325 2857
 
0.3%
136 2843
 
0.3%
267 2833
 
0.3%
239 2832
 
0.3%
Other values (355) 971343
97.1%
ValueCountFrequency (%)
114 107
 
0.4%
26 106
 
0.4%
49 103
 
0.3%
163 103
 
0.3%
216 101
 
0.3%
203 101
 
0.3%
329 100
 
0.3%
109 100
 
0.3%
296 99
 
0.3%
337 99
 
0.3%
Other values (355) 28981
96.6%
ValueCountFrequency (%)
0 2768
0.3%
1 2752
0.3%
2 2701
0.3%
3 2709
0.3%
4 2786
0.3%
ValueCountFrequency (%)
0 85
0.3%
1 63
0.2%
2 83
0.3%
3 85
0.3%
4 79
0.3%
ValueCountFrequency (%)
0 85
< 0.1%
1 63
< 0.1%
2 83
< 0.1%
3 85
< 0.1%
4 79
< 0.1%
ValueCountFrequency (%)
0 2768
9.2%
1 2752
9.2%
2 2701
9.0%
3 2709
9.0%
4 2786
9.3%